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LIGAND BINDING VARIABLE DOMAIN (V-MIN) COMPRISING A FRAMEWORK REGION WITH 
A CYCLICALLY PERMUTED CENTRAL /6 -BARREL 

The present invention relates to ligand binding variable 
domains (V-mins) having structural rearrangement of the binding domain 
of immunoglobulins and other ligand binding members of the 
immunoglobiilin superfamily. 

An immunoglobulin consists of a pair of light rpHa-iry p (each 
light chain, L, con^jrising approximately 220 residues) and a pair of 
heavy chains (each heavy chain, H, comprising approximately 400 
residues) . Each chain contains domains of approximately 100 residues 
in the form of S sheets. (Kabat, E.A., Wu, T.T., BilofsJcy, H. , 
Reid-Milner, M. , Perry, H. , 1383, Sequences of Proteins of 
Immunological Interest, Public Health Service N.I.H. Washington DC) . 
The sequence variable N- terminal domains in the light and heavy chains 
(v^ and V^) associate to form a substructure (Pv) which contains the 
antigen recognition site. and domain fragments isolated by 

proteolysis (Hochman, J., Xnbar, D., Givol, D. Biochemistry, 1973, 12, 
1130 - 1135) or recombinant methods (US 4,642,334 from ONAX 
Corporation) have been shown to fold and combine to form an Pv 
fragment which retains antigen binding activity. 

The advent of genetic engineering techniques has opened up 
the possibility of engineering antibodies, but since a different gene 
codes for each of the polypeptide chains of an antibody it is 
necessary to clone and express a gene for each polypeptide chain. 
These genes may be either expressed in separate hosts, in which case 
the resulting polypeptide dhains from each host must be allowed to 
refold in solution and combine, or the genes may be expressed in the 
same host simultaneously in which case folding and association into 
the native structure with biological activity can occur after 
expression. The need to achieve expression . of multiple genes, whether 
simultaneously in the same host or in multiple and different hosts, 
has proved to be problematic and inefficient. The impact of genetic 
engineering on monoclonal antibodies has been reviewed by Winter & 
Milstein in Nature, 349 . 1991, 293-299. 

The Fv fragment represents a known antigen recognition' 
factor comprising only about 220 residues out of approximately 1500 
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residues in the intact antibody, it still comprises two polypeptide 
ctLains. A single chain F^-like molecxile has. been described in US 
patent 4,946,778 of Genex Corporation in which a single peptide linker 
sequence foreign to the native Fv fragment is introduced to connect 
the C- terminus of one chain to the N terminus of the other chain. 
Three such constructs (TRY59, TRY61 & TRyi04B) having 234-250 amino 
acids were made and tested for binding. Ntmerous references in Genex 
are made to the fact that loss of residues from the native protein is 
to be minimised (e.g. col. 7, lines 23-24; col. 16, lines 45-47, 57) 
presumeably fearing that loss of sequence woxild lead to loss of 
activity and/or stability. However on occasions when modelling a 
single linker does not appear feasible a 2 linker construct is 
suggested. Note the 2 linker approach is only suggested when the one 
linker approach does not work (col. 7, lines 28-29) . One such 2 
linker construct (TRY 40) having 225 amino acids was prepared and 
tested and again the reader is taught to minimise the loss of native 
protein sequences (col. 27, line 6) . An error has been noted in 
Genex' s description of construct TRY 40. The sequence listed in col. 
33, lines 63-64 of Genex is not consistent with the full listing of 
the construct in Pig. 24 therein. The 2 linker construct TRY 40 
described in Genex has lost residues L41-43 and kxos-109 from the 
strands; thus destroying 3 inter-strand H-bonds and one S- bulge. The 
disruption of the strands in the central fi-barrel is problematic since 
it destroys part of the key element for maintaining a robust structure 
in the varicible region. 

The present invention is based on recognising that the 
minimal elements necessary to retain good antigen binding affinity, 
the complementarity deter minin g region (CDRs) and the central beta 
barrel structure for the Pv fragment, may be retained positioned 
appropriately in space by cyclic permutation of the central beta 
barrel structure and leaving the strands of the Z barrel intact. The 
present invention is also independently based on recognising that 
after cyclic permutation of the central beta barrel the framework 
region of the Pv can be pared down through creation of novel linkages 
in the •structure. 

According to one aspect of the present invention there is 
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provided a ligand binding variable domain (V-min) comprising: 

(a) a cyclically permuted central IS-barrel having intact strands; 

(b) outer 1^-sheet segments; 

(c) complementarity determining regions (CDRs) ; and 

(d) linker sections for linking integers (a) , (b) & (c) into a V-min, 
whereby said central fi-barrel, optionally in combination with said 
outer fi-sheet segments, orients said CDRs in space for binding ligsmd. 

The term "central fi-barrel" means the 8 stranded cylindrical 
barrel formed (at the VL-VH interface) by packing of two IS- sheets (see 
Figure 3 in Novotny et al,, 1983, Journal of Biological Chemistry, 
258., 14433-14437) . Note that Figure 3 in Novotny denotes the barrel 
as 9 stranded compared with 8 strands in our model (see Figure i) but 
the inclusion of an extra strand (5 strands from the heavy chain 
rather than 4) is in o\ir view incorrect due to it not representing a 
true strand because of insufficient H-bonding to any of the 8 
principal strands. Elements of the extra strand in Novotny are more 
correctly denoted as outer C- sheet segments (see below) . Preferably 
the linker sections provide covalent linkdLng. 

The term "cyclically permuted central E-barrel" means that 
the linear amino acid sequence order of appearamce of strands in the 
central S-barrel is permuted compared with the linear amino acid 
sequence order of appearance of strands in a parent central fi-barrel 
of an antibody. 

The term "having intact strands** means that the individual 

strands are mostly complete and undisrupted whereby interactions such 

as H-boiids between strands of the central fi-barrel are generally 

maintained despite cyclic permutation of the barrel. The following 

(canonically numbered) sequences define the strands: L33-40; Ij41-50, 

L84-90, 1.97-104; H33-41; H42-52; H88-95 and H102-109. Preferably at 

most one inter- strauxd H-bond is lost and especially no inter- strand 

H-bonds are lost. Conveniently no fi-b\ilges are lost from the strands. 

Prefersibly at most 3 amino acids are lost from the strands, more 

preferably at most 2 amino acids are lost from the str£uids, more 
♦ 

preferably at most 1 amino acid is lost from the strands and 
especially the strands are completely intact. 
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The term "outer £-sheet segments" means segments of fi-sheet, 
not being part of the central fi-barrel of an. antibody, which 
participate with the central fi-barrel in providing a framework for 
orienting CDRs'for binding ligand. 

The term "CDRs" means the hypervariable polypeptide loops 
which together determine ligand binding specificity and affinity; said 
CDRs are attached to a central IS-barrel and/or outer fi- sheet segments. 
Preferably there are 4-6 CDRs, more preferably 5-6 CDRs and especially 
there are 6 CDRs. Preferably the V-min variable domain is derived 
from an antibody although other ligand binding members of the 
immunoglobulin superfamily such as T-cell receptors are also 
contemplated. 

Preferably the CDRs substantially reproduce the spatial 
o^i^tation of CDRs from a parent antibody and substantially retain 
the ligand binding specificity and affinity of the parent antibody. 

The term "linker sections" includes sequences from any 
protein (for example foreign sequences) such as for example those on 
the Brookhaven Protein Data Bank (abbreviated PDB herein: the relevant 
address is Protein Data Bank, Chemistry Department, Building 555, 
Brookhaven National Laboratory, Upton NYX1973, USA and; a journal 
reference is Bernstein, P.C. et al . J. Mol. Biol., (1977) 112 , 
535-542) . 

The term "ligend binding" includes antigen- antibody type 
binding recognition and also ligand binding in the sense of catalytic 
functions of antibodies (see Jacobs, J.W. in Bio/Technology 2= 1991, 
258-262; and Tramontane, A. in Methods in Enzymology 178 . 531-568) . 

Preferably in the V-min the framework integers (a) , (b) and 
(d) p^y se iiave upto about 180 amino acids. The framework region is 
defined as any part of the variable domaim other ^Han the CDRs. 

Preferably the central IS-barrel and outer S- sheet segments 
are from the same antibody (i.e. parent antibody) which may or may not 
be the parent antibody from which the CDRs are derived. in some 
cir exams tances it is advantageous to reduce immunogenicity, for exanple 
for therapeutic use in htimans. To date human antibodies have proved 
difficult to produce for therapeutic use, therefore alternative 
approaches have been sought. One such approach is to "humanise" 
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antibodies of non- human origin. For example CDRs from a nioiise parent 
antibody may be grafted onto a human parent antibody framework region 
using known techniques (see EP 239400) thvis reducing antigenicity in 
humsuis compared with the complete mouse parent. Another example is to 
graft a complete mouse variable region onto a htiman constant domain 
(see US 4 816 S67) . Another option is "antibody veneering" where 
surface residues of an antibody are altered to mimic a htiman antibody 
(George Mark presentation at 2nd Annual IBC Conference on Antibody 
Engineering, December 16 - 18, 1991, San Diego, USA and: £P5l35d6 , 
Merck/NIH, -Padlan et al) . 

According to another aspect of the present invention there 
is provided a ligaind binding variable domain (V-min) comprising: 

A) a framework region comprising: 

(a) a cyclically permuted central ^-barrel; 

(b) outer beta-sheet segments auid; 

(c) linker sections-; and 

B) complementarity determining regions (CDRs) ,- 

whereby said central IS-barrel, optionally in combination with said 

outer sheet segments, orients said CDRs in space for binding ligand; 

the linker sections provide lixiking between A) & B) to form the V^min, 

in which the framework region per se has upto about 180 amino acids. 

Preferably the central beta barrel has intact strands . 

Preferably the V-min possesses an affinity for ligand of at 

at least 10E-5M; more preferably the V-min possesses an affinity for 

ligand of at least 10E-6M; more preferably the V-min possesses an 

affinity for ligand of at least 10E-7M, more preferably the V-min 

possesses an affinity for ligand of at least lOE-SM, more preferably 

the V-min possesses an affinity for ligand of at least 10E-9M, more 

preferably the V-min possesses an affinity for ligand of at least 

lOE-lOM and especially the V-min possesses an affinity for ligand of 

at least lOE-llM. In general, when a V-min is based on a parent 

antibody, the V-min preferably has an affinity for ligand at least 

about the same as that of the parent antibody. Modification of V-min 

to give increased affinity compared with a parent antibody is 
« 

contemplated. 

Preferably the V-min has less than 2 covalent linkages 
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between tlie heavy and light chains. 

Preferably there is provided a V-min as hereinbefore 
described in the form of one continuous polypeptide. This aspect 
simplifies recombinant production of the V-min. However chemical 
synthesis of V-mins is also contemplated. 

Preferably there is provided a V-min as hereinbefore 
described comprising the following core segments: light chain residues 
21-55; light chain residues 84-104; heavy chain residues 21-60 and; 
heavy chain residues 88-109. This aspect has the advantage of 
preserving the tight IS- turns in the heavy and light chains around 
residue 40. 

Preferably there is provided a V-min as hereinbefore 
described wherein the linker sections essentially consist of sequences 
from the antibody used as the source of the central S-barrel. This 
aspect has the advantage of substantially avoiding the introduction of 
foreign sequence into the V-min. Thus the modelling process is 
simplified since conformation of foreign linkers does not have to be 
modelled; furthermore immunogenicity is reduced in the host species 
used to provide the antibody framework. 

Preferably there is provided a V-min as hereinbefore 
described, having less than about 180 amino acids in the framework 
regions. Preferably the V-min has less than about 175 amino acids, in 
the framework regions, more preferably the V-rain has less than about 
170 amino acids in the fraunework regions, more preferably the V-min 
has less than about 165 amino acids in the framework regions, more 
preferably the V-min has less than about 160 amino acids in the 
framework regions, more preferably the V-min has less than about 150 
amino acids in the framework regions, more preferably the V-min has 
less than about 140 amino acids in the framework regions, more 
preferably the V-min is less than about 130 amino acids in the 
framework regions and especially the V-min has less than about 120 
amino acids in the framework regions. 

Chothia C. and Lesk A.M. in Journal of Molecular Biology, 1987, 126, 
901-917 define the canonical CDRs as : 



.9412G25A2 J > 



wo 94/12625 



7 



PCT/GB93/02375 



L26-32 L50-52 L91-96 
H26-32 H53-55 H96-101 

which means that the canonical Framework regions are: 

Ll-25 Ii33-49 L53-90 L97-107 Fv 91 residues 

Hl-25 .H33-52 H56-95 H102-112 Pv 96 residues 

total 187 residues 

This compares with one exemplified canonical V-min framework described 
in the examples set out below: 

H87-95 H102-109 H15-25 H33-52 H56-H64 Ll-7 L21-25 L33-49 Ii53-63 L79-90 

9 8 11 20 9 7 5 17 11 12 

L97-104 

a 

total 117 residues 

i.e. a reduction of 37% (calculated from a reduction of 70 residues 
from the original 187 residues) . Molecular Design and Modelling 
Concepts and Applications are described in Methods in Enzymology, 
volumes 202 and 203, Academic Press. 

Mini m ally sized antibodies are desirable for certain 
applications such as for exan^le where improved tissue penetration or 
fast clearance from the body is important (Friedman, P.N. et al.. 
Cancer Research 53, 334-339: 1993) . Iii5}roved tissue penetration is 
important in imaging applications and fast clearance from the body is 
important in clearance of toxic drugs from the circulation and in 
antibody directed enzyme prodrug therapy. Ward et al . in Nature, 341 . 
1989, 544-546 have described the binding activity of single VH domains 
called dAbs. DAbs have the disadvantage of only being able to present' 
3 CDRs and exposure of hydrophobic amino acid residues to polar 
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environments gives rise to handling problems sucli as non-specific 
binding. 

Further advantages of size minimalisation are that simpler 
structures may lend themselves to more specific chemistry or a wider 
range of chemistries during for example conjugation and the 
simplest/most compact structure may be more amenable to 
oligomerisation of a series of binding elements either in a homo or a 
hetero oligomer. 

According to another aspect of the present invention there 
is provided a multivalent ligand binding moiety comprising at least 
one V-min as hereinbefore described . 

The term "multivalent" means that one mole of binding moiety 
can bind more than one mole of ligand; said ligand comprising one 
ligand (i.e. monospecific) or more than one ligand (e.g. bispecific) . 
Preferably the multivalent ligand binding moiety is bivalent. For 
example one mole of bivalent ligand binding moiety can bind 2 moles of 
ligsuad and conprises either 2 ligand binding V-min riri ma-iT^^ ©f the 
present invention or one ligand binding V-min domain of the present 
invention and one other ligand binding moiety such as Fab, Fv or 
single chain Fv. 

According to another aspect of the present invention there 
is provided a conjugate comprising a V-min as hereinbefore described 
or a multivalent ligand binding moiety defined as heredLnbefore 
described and an effector molecule. 

The term "effector molecule" refers to a molecule capable of 
producing an effect or a signal. Preferably the effector molecule is 
an enzyme, catalytic antibody, toxin, drug or radionuclide and 
especially the effector molecule is an enzyme or catalytic antibody 
for antibody directed enzyme prodrug therapy (i) . The effector 
molecule may be covalently and/or non-covalently linked to the v-min. 
Known techniques for covalent linkage include heterobifunctional 
linkers and protein fusion. A known technique for non- covalent 
linkage is via leucine zipper motifs and the like (Kostelny, S .A. et 
al., J. Immunol, 14B 1992, 1547 -- 1553 and Pack, P. et al , , 
Biochemistry 31 1992, 1579-1584) . In some cases a mixture of both 
covalent and non-covalent linkage is used. For example a polylysine 
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moiety may be covalently linked to a V-min and the positive charge on 
the polylysine moiety may non-covalently link a complementary charged 
moiety such as nucleic acid. Another example is covalent linkage of a 
chelate to a V-min; the chelate then non-covalently binding a moiety 
such as a radionuclide. 

According to another aspect of the present Invention there 
is provided a polynucleotide sequence encoding a V-min as hereinbefore 
described or a multivalent ligand binding moiety as hereinbefore 
described. 

According to another aspect of the present invention there 
is provided an expression vector comprising a polynucleotide sequence 
as hereinbefore described. The expression vector may direct secretion 
of translated polypeptide chain for facilitating folding of the chain 
into a ligand binding conformation. Conveniently ^e expression 
: vector directs expression of translated polypeptide for intracellular 
accumulation, especially as inclusion bodies. 

According to another aspect of the present invention ^here 
is provided a host cell comprising an expression vector as 
hereinbefore described. Preferably the host cell is prccaryotic and 
especially the host cell is E. coli . 

According to another aspect of the present invention there 
is provided a method of producing a V-min as hereinbefore described or 
a multivalent ligand binding moiety as hereinbefore described 
comprising culture of a host cell as hereinbefore described. 
Methods for production and analysis of single chain antiibody molecules 
as well as general antibody engineering techniques have been set out 
in the following publications : 

1. BEDZYK, W.D. : J. Biol. Chem. , (1390), 18615-18620 

2. HUSTON, J.S.: Proc. Natl. Acad. Sci., (1988), 85., 5879-5883 

« 

3. BIRD, R.E.: Science, 242 , (1988), 423-426 

4. WHITLOW, M. : Methods, A Companion to Methods in Enzymology, 2.(2), - 
(1991) , 97-105 
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5. WARD, E.S.: Anti±>ody Engineering, A Practical Guide, C.A.K. 
Borraebaeck (Ed), W.H. Freeman New York (1992), Ch 6: 121-137 

- • ■ 

6. ANAND, N.N. : J. Biol. Chem. , 266(32), (1991), 21874-21879 

7. SKERRA, A.: Current Opinion in Immunology, 5,, (1993), 256-262 

8. PLUCKTHDN, A.: Bio/ technology, 9, (1991), 545-551 

9. GLOCKSHDBER, R, : Biochemistry, 29, (1990), 1362-1367 

10. SKERRA A. : Bio /technology, 9 (1991) 273-278 

11. PLUCKTHON,^ A.: Nature, 347 . (1990), 497-498 

12. FORD, C.F.: Protein Expression & Purification, 2, (1991), 95-107 

13. PIjUCKTHDN, A- : Immunological Reviews, 130 . (1992), 151-188 

14. HOLLINGER, P.: Current Opinion in Biotechnology, 4 (1993), 

446-449 

15. BRINKMANN, U: Proc. Natl. Acad. Sci. USA, 90 (1993), 7538-7542. 
16 . Methods in Enzymology , vol 178 . 

Analogotas methodology can be applied to V-mins described 
her-ein. Inspection of databases has indicated over 100 publications 
at present on single chain antibody molecules covering many different 
antibodies thus indicating the general applicability of modelling 
concepts across this family of molecules. 

According to another aspect of the present invention there 
is provided a pharmaceutical composition con^rising a conjugate as 
hereinbefore described an active ingredient in association with a 
pharmaceutical excipient or carrier. Conveniently the composition 
coTuprises an effective amount of the active ingredient . 



9412e2SA2 I > 



wo 94/12625 



11 



PCT/GB93/02375 



According to another aspect of the present invention there 
is provided a radioimaging composition comprising a conjugate as 
hereinbefore described as an radioimaging ingredient in association 
with a pharmaceutical excipient or carrier. Conveniently the 
composition comprises an effective amount of the radioimaging 

The reader is referred to Comprehensive Medicinal Chemistry, 
Volume 5 (Biopharmaceutics) , Hansch (Editor), Pergamon Press 1990; 
especially chapters 25.2 (formulation), 25.3 (Dosage Regimes) euid 25.5 
(Drug Targetting) . Section 25.5.5.1 therein discusses immunotoxins 
and Section 25.5.5.3 discusses antibody- radionuclide conjugates for 
imaging and radiotherapeutic purposes . Radioimaging is described in 
Larson (1987) Radiotherapy 165 . 297. A reference work on toxins is 
"Genetically Engineered Toxins" edited by A.E. Frankel, Marcel Deleter 
Inc. 1992. Immunotoxin Preparation is discussed in chapters 20 and 21 
of Methods in Enzymology, Vol 93, Academic Press 1983- 

According to another aspect of the present invention there 
is provided a method of treatment of a tumour comprising the 
administration to a human or animal in need of such treatment of a 
pharmaceutically effective amount of a conjugate as hereinbefore 
described. 

According to another aspect of the present invention there 
is provided an immunotoxin phaxmaceutical composition comprising a 
V-min as hereinberore defined in association with a pharmaceutical 
excipient or carrier. 

According to another aspect of the present invention there 
is provided a radioimaging composition comprising a V-min as 
hereinbefore defined in association with a pharmaceutical excipient or 
carrier. 

» 

According to another aspect of the present invention there 
is provided an antibody- enzyme pharmaceutical composition for ADEPT 
comprising a V-min as hereinbefore defined. Many cytotoxic conrpounds 
have been discovered which are of potential use in cancer 
chemotherapy. Nitrogen mustards forra one important family of such 
cytotoxic compounds. The clinical use of cytotoxic compounds in 
general and nitrogen mustards in particular has been limited because 
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of the poor selectivity in the cytotoxic effect between tumoxor cells 
and normal cells. One approach to overcome this problem has involved 
the development of so-called pro -drugs which are derivatives of the 
cytotoxic drug, often a relatively simple derivative, whose cytotoxic 
properties are considerably reduced cott^^ared to those of the parent 
drug. Proposals have been made for the administration of such 
pro-drugs to patients under regimes whereby the pro-drug is only 
converted to the cytotoxic drug in the region of the intended site of 
action- One approach involves linkage of a cytotoxic parent nitrogen 
mustard with cm amino acid to form a pro-drug which can be converted 
to the parent nitrogen mustard at the site of intended action under 
the influence of an enzyme. This approach can be put into practise by 
the utilisation of an antibody/ enzyme conjugate in association with a 
pro- drug. The antibody/ enzyme conjugate is formed from an antibody 
selective for tumours and an enzyme that will convert the pro-drug to 
the cytotoxic drug. In clinical practice, the antibody/ enzyme 
conjugate is first ad m inistered to the patient and is allowed to bind 
to the txxmour. After a suitable period of time, to allow clearance of 
the antibody/ enzyme conjugate from the rest of the body, the pro-drug 
is administered to the patient. Conversion of the pro-drug, tinder the 
influence of the localised enzyme, to the cytotoxic drug takes place 
mainly in the region of the tumour. Such a system (Antibody Directed 
Enzyme Prodrug Therapy or ADEPT) is described in International 
Application PCT/GB88/00181 pxiblished as WGS8/07378 and US Patent No 
4,975,278. 

According to another aspect of the present invention there 
is provided a method of treatment of a tumour comprising the 
administration to a human or animal in need of such treatment of a 
pharmaceutically effective amovint of a pharmaceutical composition for 
ADEPT or oui imimmotoxin pharmaceutical composition. 

According to another aspect of the present invention there 
is provided use of a radioimaging composition as hereinbefore defined 
to produce a radioimage as sm aid to diagnosis of a condition in a 
htoman or animal. 

The invention is illustrated, but not limited, by the 
following examples in which: 
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Figure 1 illustrates the Beta Strand Arrangement of Inununoglobulin 
Variable Regions in which structurally important residues Cysteines 
(L23-L88, H22-H92) indicated by C and Tryptophans L35 and H36 are 

» 

represented by W. The upper section illustrates the outer iS-sheets 
for each chain. These sheets overlay the inner sheets shown in the 
lower section of the diagram. Folding along the horizontal crease 
yields the fi-sandwiches for the two chains. Juxtaposition of the 
inner sheets (fold along the vertical crease) . generates the central 
i^-barrel (highlighted) . Folding along the horizontal crease and then 
. rolling the sides of the paper to meet each other generates the inner 
central barrel surrounded by outer strsuids etc. 

The dotted lines connecting the strands represent hydrogen bonding 
between strands; straight dotted lines are H-bonds formed between 
antiparallel strands and wavy dotted lines are H-bonds formed between 
parallel strands. 

Figure 2 illustrates the core segments as emboldened sequence. 

4 

Figure 3 illustrates the Beta Strand Arrangement of Immunoglobulin 
VarisUale Regions the V-min Construct 4m in which: 

I strands above paper 

: iS- strands behind paper 

. vertical linkers 

- horizontal linkers 

# non- sequential links for V-min 

The emboldened sequence is the V-min construct. 

Figure 4 illustrates alignment of FAB light chains for F19.9 & IMCP 

Complementarity Determining Loops are underlined. 
Canonical Structure Type (reference 4) is indicated alphabetically 
Framework residues important to Loop conformations are indicated by 

* 

Segments used in the V-min design are demarcated by < >. 

The order of appearsuice in the V-min sequence is indicated by s#. 
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Figure 5 lists: (a) amino acid sequence for F19.9 V-min constructs; 
(b) amino acid sequence for IMCP V-min. 

* 

Figure 6 lists the initial nucleotide sequence for F19.9 V-min. 

Figure 7 illustrates a V-min construct in bold line with redundant 
sequence in faint line. 

Figure 6 shows the a-carbon diagram of a final V-min structure. 

Figure 9 shows a V-min conqpared with a Pv structtire. Faint line on 
the Fv represents sequence discarded to form the V-min. The bottom 
left comer of the V-min represents TAG sequence added. 

Figures 10, lOA & 11 show that the environment of possible problematic 
polar groups in V-min structures have adecpiate bonding. Figures 10 
and lOA show the environments for residues GluSS and Lys97 in F19.9 
V-rain respectively. Figure 11 shows the environments for LyslOl and 
GlnlOS in IMCP V-rain. 

Figure 12 is a diagrammatic representation of Pv, central & barrel and 
minipial V-min constructs. 

Figure 13 illustrates the v^ fi-strand arrangement. 
Figxire 14 illustrates the V iS-strand arrangement. 

Figure 15 illustrates the Fv fi-sandwich arrangement with the heavy 
chain in continuous line and the light chain in broken line. 



Figure 16 illustrates a conservative V-rain construct including 
redundant -segments . 

Figure 17 illustrates the conservative V-rain construct of Fig 16 but 
excluding redundant segments . 

Figure 18 illustrates a minimal V-min construct including redxindant 
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segments . 

Figure 19 illustrates the minimal V-min construct of Fig 18 but 
excluding redundant segments. 

Figure 20 illustrates construction of a MCP603 v-min gene from 
oligonucleotides . 

Figure 21 is the peptide sequence of a MCP603 V-min construct 
including c-myc peptide tag. 

Figure 22 illustrates pBluescript. 

* 

Figure 23 illustrates pMIN603. 

Figiire 24 illustrates construction of pICI0020. 
Figure 25 illustrates a trp promoter. 
Figure 26 illustrates construction of pTB344. 
Figure 27/21 A illustrates construction of pXCI0042. 
Figure 28 is a transcription terminator sequence. 
Figure 29 illustrates construction of pLB013 . 
Figure 30 illustrates pICI123 . 
Figure 31 illustrates pICI1079. 

Figure 32 illustrates construction of pXCX1572. 

Figure 33 illustrates construction of a F19.9 V-min gene using 
oligonucleotides. 
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Figiire 34 illuscrates a F19.9 v-min peptide sequence. 

Figure 35 shows the sequence of pSWl-VHD1.3-VlCD1.3-TAGl from the start 
of the first pelB leader sequence to approximately 50 bases beyond the 
end of the TAGl. Numbered from the start of the pelB. Amino acid 
sequence of the pelB-VHDi.3 and pelB-VKDl.3-TAGl genes are shown below 
^he DNA sequence. 

Figure 36 shows PGR generation of Dl.3 V-min 

Figure 37 shows products from first round PGR reactions 

Figure 38 shows a plasmid map of pICI226 

Figiire 3 9 shows a plasmid map of piJFH3037 

Figure 40 shows Dl.3 V-min conservative 

Figure 41 shows h65-115/hl3-64/ll.i08 Dl.3 V-min with Intact Light 
Chain 



Figure 42 shows 161-107/113-60/hl-116 Dl.3 V-min with Intact Heavy 
Chain 



Figure 43 shows h87-llS/hl3-64/ll-7/121-61/l76-108 Dl.3 V-min 
minimal Construct 

Figure 44 shows h67-l25/hl3-66/ll-li2 KOL V-min with Intact Light 



Figure 45 shows l62-109/112-61/hl-125 KOL V-min with Intact Heavy 
Chain 



Figure 46 shows h66-118/hl3-65/ll-li2 PAC-1 V-min with Intact Light 
Chain * 
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Figure 47 shows 166-lll/112-65/hl-123 PAC-1 V-min with Intact Heavy 
ChadLn 

Figure 48 shows binding data for a Dl«3 V-min 

Table 1 lists Ca-Ca distances for linker sections of the MPC Fv 
fragment 

a) Ii55-84 and L104-107 versiis Ll-21 

b) H60-88 versus Ll-21 

c) H60-88 and H108-113 versus Hl-21 

d) LS5-84 versus Hl-21 

e) L55-84. versus L104-107 and H60-88 versus H108-113 

f ) Intra Section relationships 

Table 2 lists oligonucleotides used in gene assembly 
K3CAMPIiR 1 MOngr.T.TNG STDDZSS 

V-min proteins have been designed, modelled and amalysed. 
One is based on a known (2) FAB X-ray structure, IMCP (also referred 
to herein as McPC603) , a mouse monoclonal which binds phosphatidyl 
choline. Another is based on F19.9 which lacks an experimental 
structure for the FAB fragment. We have therefore generated a FAB 
model for F19.9 by analogy with the known structure of IMCP. This 
homology modelling approach (3) is a well established method for 
generating structural models of proteins that exhibit more than 30%. 
identity to a protein whose 3D structoire has been determined 
experimentally. The methodology set out below can be applied to any 
antibody and was \ised to generate the V-min structures in the examples 
herein . 

1 • X Background 

The variable Fv region of the FAB structure forms the basis for 
designing the V-min structure and, in the general case, a structural 
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model of the Fv region is generated by homology modelling based on the 
most closely analogous known stiructure. 

As indicated earlier, the central eight- stranded fi-barrel of the Pv 
structure is retained in the V-rain design. In the Fv structure, four 
IS-strands are contributed by the light chain, and four by the heavy 
chain. 

Four of the six complementarity determining regions (CDR's) appear as 
apical loops connecting these fi-6trands (L3, L2, H3 & H2 in Fig i) 
The remaining two CDR's connect the N- terminal segments of each chain 
to the first barrel S-strand in that chain (Ll & HI in Fig i) . 
The arrangement of fi-strands in the variable regions of immunoglobulin 
light and heavy chains is shown in Figure 1 using the canonical 
numbering of Chothia and Lesk (4) . 

In Figure i, the upper section illustrates the outer 15-sfaeets for each 
chain. These sheets overlay the inner sheets shown in the lower 
section of the diagram. Folding along the horizontal crease (dotted 
line) yields the fi-sandwiches for the two chains. 

of the inner sheets (fold along the vertical crease) 
generates the central B-barrel (highlighted) . 

Figures 13, 14, and 15 also illustrate the arrangement of IS-strands. 
1-^2 The Minimal Recogxiition Structure 

In addition to the central £-barrel, four segments of fi-strand 
(1*19-25, L53-55, H18-25, HS6-60) are required for the generation of 
CDR's-I and II in both chains. 

Residues on either side, in sequence terms, of the CDRs are anchored 
in the central fi-barrel. These three segments per chain are defined 
below: 

Light Chain 21-39, 42-55,. 84-104 
Heavy Chain 20-40, 43-60, 88-109 

In order to avoid disrupting the two tight fitums at L.3 9-L42 and 
H40-H43 we have adopted the following four core segment model (see 
Figure 2) : 
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Two core segment: s from the Light Chain 

L21-S5 ie light chain residues 21-55 (contains CDR's LI- and LII) 
L84-104 ie light chain residues 84-104 (contains CDR LIU) 

Two core segments from the Heavy Chain 

H21-60 ie heavy chain residues 21-60 (contains CDR's HI and HII) 
H88-109 ie heavy chain residues 88-109 (contains CDR HIII) 

1.3 Single Chain Constructs 

In order to produce a single chain species, these four core segments 
are linked together using spatially appropriate sections from the 
intact structure. These linker sections are indicated below (NB lower 
case is used to differentiate linker sections from the core segments) . 



Section 


Chain 


Residue Hixmbers 


11-20 


Light 


1-20 


156-83 


Light 


56-83 


1105-107 


Light 


105-107 


hl-20 


Heavy 


1-20 


h61-87 


Heavy 


61-87 


hl09-113 


Heavy 


109-113 



(NB non-natural, that is foreign, linker sections could also be 
designed to do this such as for example xising sequences extracted from 
the Brookhaven data bank as outlined in Genex US4946778) 

The cyclic nature of the central S-barrel means that the four core 
segments can be combined In different orders ■ For example, by linking 
the C- terminus of a given segment to any one of the N- termini from the 
three other segments. There are thus potentially 24 different 
combinations. Their feasibility can be determined by means of the 
spatial relationships of the available elements of the intact 
structure (ie the linker sections) that can be used to join the core 
segments. The adjacency of these linker sections can be evaluated in 
terms of the distances between the a-carbons of the residues involved 



wo 94/12625 



PCT/GB93/02375 



- 20 - 



. (see Table 1) 



These distance tables can be used to produce an adjacency matrix of 
possible linkages. If any a-carbon in one linker section is 
sufficiently close to any of another we define the sections as 
adjacent- The cut-off distance that we have used is ISA, since this 
allows for the reorientation of up to six residues, in order to create 
a contiguous structure. In the following table, + indicates sections 
that are adjacent by the above definition, - indicates sections that 
are too distant to connect and 0 represents self with self. 



11-20 



156-83 1105-107 



hl-20 h61-87 



hl09-113 



11-20 

156-83 

1105-107 

hl-20 

h61-87 

hl09-113 



0 
+ 



0 
+ 



+ 
+ 
0 



+ 
0 
+ 



We can enumerate the possible combinations of linker segments via the 
adjacency diagram shown below, where double-headed arrows indicate two 
adjacent sections. 



107-105 

/ \ 
/ 64-104 

/ \ 
1-20. 21-55 . 56-B3 

I 
I 

I I 
V V 

87-61> 60-21 .20>l 

\ / 
88-108 / 

\ / 
109-113 



Light Chain 



Core Segments are. underlined 



In the following, we use example residues to illustrate possible 
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connections. Residues in the core segments (upper case) and linker 
sections (lower case) are separated by Wh^re segments or sections 
are separated by : the sequence is continuous as for the native 
protein. Where / is used a novel linkage between segments is formed. 

The lack of a direct connection between 1105-107 and hl-20 (or 
hl09-ll3 and 11-20) means that the native sequence order can only be 
preserved by use of a foreign linker section ( ! ! ) : - 

11-20 :L21-55:l56-83:L84-104:1105-107/U/hl-20:H21-60:h6l. 87 :H88-108:hl 
09-113 



or vice versa. 



Some of the proximal distances are non-productive in that they would 
connect a single segment in a closed fashion. For example; the 
distance between light chain residues 20 and 74 is 4A but this merely 
connects linker sections 156-83 to 11-20, thus enclosing the core 
segment L21-55. 



11 20:Ij21 55 : 156 



83 



74 



Exploiting the adjacency diagram, in£er-chain connections involve: 

11-20 < > h61-87, or hl-20 < > 156-83 

and the following four combinations may be possible : 

1. L21-55: 156-83 ;L84-104/ ll-20/hgl-Q7 :HSa-lQB ; hiQQ, i ti />,i --^a /^^l 60 

■ • 

2. H21-60:h6l-87 :H88-108/ hl- 20/156-83 :LB4-1Q4 : HQS- 1 n7/1 i -on /^-^o-^ , 55 

3. L84-104: 1105-107/11-20 ;L21-55; l56-83/hl-2Q =H21-6Q.hgT .trofl--, no 
hl09-113 

4. H88-lQ8;hl09-113/hl-20:H21-60: h61-B7/ll-2Q >L21-55n :L84-104 
1105-107 
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in choosing specific residues from the linking sections to generate 
the connected sequences, we make use of the distance tables to 
identify candidates. 

in co^^styugt l , residues from the linker sections I1-20 and h61-87 are 
to be used to connect core segments L84-104 and HB8-108. Analysis of 
the distance tables suggests that light chain residue l can connect to 
heavy chain residue 61, and that light chain residue 104 could link to 
light chain residue 13. The two candidates from 11-20, ie 13 and l, 
also need to be connected. These two residues are connected in 
sequence 11-13, unfortunately, the direction would need to be reversed 
in this context. 



h61 

I 
I 

LB4 104 11 



>87 



I 

13 20 HB8 — 108 



This process reveals that construct.^ 1 aQd 2 are unlikely unless 
linkers (U) are used. 



X. L21-104/Lil/ll/ii/H61-113/H13-65 
2. H21-108/H13/U/21/LS7-107/L13-60 



Constructs 3 ang 4 appear to be viable and represent generic 
constructs that produce single chain species in which the sequential 
order of the CDR's is changed and novel linkages using native residues 



in coffsi^rvct 3^ the sequential order of the light chain is reorganised 
to provide a connection to the intact heavy chain, whilst in constru.^ 
4 the heavy chain is reorganised and the light chain preserved. 
Choosing specific connecting residues by means of the distance tables 
yields the following exemplifications: 



3a. L83-107/til3-57/Hl-113 
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4a. H87-113/Hi3-64/Ll-107 



.Minimal Single Chain Constructs 

The distance tables indicate that constructs 2 and 4 . which consist of 
approximately 185 residues, may be reduced in size by: 

a) Omitting sequence that spans key segments which are close in space. 

Thus in construct 3, section hl-20:H21-60 can be replaced by 
hi-6/H2i-60 thereby saving fifteen residues- 



I I 

I I < omitted sequence 

I I 

I |20 

. . H21 60 



hi 



b) Splicing those candidate junctions which minimise the number of 
residues needed. 

. Thus in construct 4, hl09-ll3/hl3-20 can become hl09/hl5-20, a 
saving of six residues . 



I ^ I 
hlOS-- --113 

/ : 
20-< < hl3 

15 

Such considerations lead to the following minimal single 
constructs of about 150 residues in which similar numbers of residues 
are contributed by the originating chains. 

3m. L83-106/L1S-57/H1-6/H21-60/H88-113 
4m. H87.109/H15-64/L1-7/L21-63/L79-104 

We now further illustrate the approach by .adopting construct 4m 
(Figure 3) for further modelling analysis . 

1-4 Alignment and V-min constructs 
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The alignment of F19.9 with IMCP was performed within QUANTA (5) and 
topological consequences were checked using computer graphics 

The F19.9/1MCP alignment is shown in Figure 4 and the V-min constructs 
are as follows : 

si s2 S3 s4 s5 

Canonical H87.H109/H15-H64/Ll-L7/L21-L63/Ii79-L104 
IMCP H93-H118/H15-H67/Ll-I.7/L21-Ii69/L85-L110 * 

F19 . 9 H93 -Hlll/HlS-H67/Ll-L7/L21-L63/L79-Iil04 

The resultant amino acid sequences are shown in Figure S with 
associated C-terminal TAG residues (used to aid subsequent detection 
in binding assays) and N-terminal modifications which may result from 
cloning into certain expression vectors. An initial nucleotide 
sequence for F19.9 V-min is shown in Figure 6. 

In Figure 4, the V-min segments, and the CDR's are indicated. In 
addition, certain framework residues which Chothia suggests (4) are 
in^ortant to CDR loop conformations are identified. The V-min 
construct shown above incorporates five of these residues, however 
tiiree are omitted. Of these, L64 Gly (iMCP L70) {important to CDR LH) 
and L71 Tyr (IMCP L77 Phe) (in^jortant to CDR LI) could be restored by 
replacing s4 & s5 with L21-L104 (IMCP L21-L110) . 

Restoration of the remaining residue H71 Arg (H74 in F19.9 and IMGP) 

which is important to the juxtaposition of CDR's HI and .HII coxild be 

achieved by initiating the sequence earlier than H87, eg at H68 (H7a 

F19.9 and IMCP) . Alternatively, it may be possible to mutate eg H49 

{H52 Asn in F19.9 and IMCP) to reinforce the loop positions. 

Further illustration of the v-min concept is provided in Figures 12 - 
19. 



1-5 Modelling the V-mln Proteins 

1.5.1 Introduction 

The f ifrst step is to identify an X-ray structure of a Fab on which to 
base the model. In the case of IMCP the X-ray structure has been 
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determined. In general, the decision is baiaed on the degree of 
identity between the sequence of known Fab s.tructures and the target 
protein. Particular emphasis is placed on the relationship of the 
corresponding CDR loop sequences and their associated conformational 
determinants from the framework (4) . Numerous Pab X-ray structures 
are available in the Brookhaven data base. The IMCP Pab structure 
represents a viable starting point for the development of a homology 
model of F19.9. The Fv portions of the Fab structures provide useful 
reference points for evaluating the feasibility of the V-min models in 
terms of the residue environments etc. 
1.S.2 IMCP V-min 

The regions of the IMCP structure which are not involved in the V-min 
construct were deleted using the QUANTA (5) utility. The remaining 
structure was output as a PDB file which was edited to the proposed 
V-min order. The resulting PDB file was imported into QUANTA as an 
initial V-min structure. 

This structure required remodelling of the junctions between segments 
to create a connected structure. This was undertaken using an 
annealing protocol which can be generalised as follows: 
Fix all atoms other than those in the annealing zone. 
The annealing zone consists of ca four to six residues (eg two on 
either side of the proposed "join"). The size of this annealing zone 
is reflected in the 15 A criterion for "adjacency" used earlier. 
Perform constrained optimisation using CHARMM (6) until residual 
gradient is leas than l Kcal/A 

Check for torsion angle violations (eg cis peptides etc) . 

If necessary, incorporate appropriate dihedral angle constraints and 

re-optimise. 

After all junctions were annealed, the resulting structure was energy 
relaxed in two stages, firstly, all side chain atoms were allowed to 
relax and this was followed by a full energy relaxation. The 
disiaphide distances were constrained in these steps. 

At this stage, the "tag" sequence was inserted after the C-terminal 
residue. The "tag" sequence plus the last two V-min residues formed an 
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annealing zone and the protocol described earlier was followed. 
Additional distance constraints were applied, to bring the non-polar 
residues in the tag sequence into contact. 

A final full energy relaxation was applied and an a-carbon diagram is 
shown in Figure 8; a corresponding Figure 7 shows the V-min construct 
in bold and redundant sequence in faint line. 



1.5.3 



F19.9 V-min 



The proposed amino acid sequence file for this construct + "tag" was 
generated and QUANTA utilities used to map the sequence onto the 
structure of IMCP V-min + "tag" generated earlier. As can be seen in 
Figure 4, two deleted sections with respect to IMCP occur in the CDR 
loops. Thus, two annealing steps were undertaken followed by energy 
relaxation of sidechains and, finally, of all atoms as described 
earlier for the IMCP V-min model. 

Two candidates for the N- terminal modification are indicated in Figure 
5 . The shorter candidate requires a glycine to be replaced by leucine . 
initial inspection of the v-min model indicated . that this could be 
problematic. Thus, the necessary mutations and insertions were 
accomplished using QUANTA and a zone consisting of the candidate 
sequence plus the next residue was annealed followed by energy 
relaxation. 



1.5.4 



Modelling The FXS.9 FAB Structure 



model was required 



construct. The alignment 
mapping the F19.9 
deletion and insertion 
and the side chaxns were 



also as described earlier 



for comparative analysis with the F19.9 V-min 
shown in Figure 4 formed the basis for 

onto the IMCP structure using QUANTA. The 
zones were annealed as described for the v-mins 
energy relaxed, followed by the full molecule 



1.5.5 Analysis of The V-min Models 

The following criteria have been applied in assessing feasibility of 
the models, .the residue numbering is that of the V-mins: 
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1.5.5 Potential 

The energy of the system should indicate a lack of steric clash, and 
internal strain. In the CHARMM force field using a distance dependent 
dielectric (4r) , a general rule of thumb is that energies /residue 
should be in the range -4+1 Kcal/mole. 

Potential Enerorv 
kcal/mole kcal/reg-i Hi.o 

F19.9 V-min + tag -556 .3^^ 

F19.9 V-min/tag/N- terra -543 .3.4 

IMCP V-rain + tag -689 .4^2 

1.5.5.2 Dihedral angles 

Peptide omega angles (C'-N torsion) should be trans (180° +/- 20«) . 
Cis peptides are allowed if they precede proline (7) . 
Main chain phi angles (N-Cof torsion) should be negative for L-amino 
acids. Positive values are acceptable for glycine residues (7). 



Positve 


Phi Anales 






F19.9 V-min 


IMCP V-min 


Comment 




Gly 16 


Gly 23 






Gly 21 


Gly 28 






Gly 31 


Gly 38 






Asn 62 


Lys 69 positive in IMCP Fab Xray 


structure 




Gin 100 + in 


IMCP; deleted in F19. 


9 


Gly 99 


Gly 112. 






Arg 108 


Gly 121 + in 


IMCP but G=>R in F19. 


91 


Gly 115 


Gly 128 






Gly 142 


Gly 155 






Cis Omeaa Anales 






F19.9 V-min 


IMCP V-min 


Comment 




135 Pro 136 


Tyr 148 Pro 149 


cis in IMCP Xray 
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Omega angles outside the range 180«> -y no 



Fig. 9 V-min w 
Asp 72 159 
Asp 123 -152 
Leu 130 159 



IMCP V-min w 



Asp 79 
Asp 136 
Gin 143 



159 
-152 
159 



hys 144 153 Lys 157 154 



Comment: 
junction s2-s3 
junction s4-s5 

junction s5-tag 



Glu 146 156 
He 150 154 



Glu 159 
lie 163 



156 The following are 
154 residues from the 



Ser 151 154 ser 164 154 "tag" sequence and 
Glu 153 152 Glu 166 152 less well modelled 



1.5.5.3 

For folded proteins main chain 
frequency of 60 ± 20 for every lOO 



bonds occur at an average 
(8) . 



F19.9 V-min 
IMCP V-min 
IMCP Fab (Xray) 
F19.9 Fab 



H-Bonds/1 nn 
All Main Chain sheer, 
55 22 
57 25 
59 31 
63 29 



Turns 
22 
21 
15 
18 



1.5.5.4 Bxposed Non-Polar residues 

in general the surface accessibility of non-polar residues should be 
less than half of that of a Gly-x-Gly tripeptide in the extended 
conforn«tion (9, . i„ addition, free energies of hydration for residues 
Should be negative or zero. 

For the present purpose, we report non-polar residues that are exposed 
and have positive hydration free energies (10) . and which also differ 
significantly from the parent Pab model 

■ 
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F19 . 9 


Accessible Fraction 


Hvdration Free 




(kcal /mnl 


1 




V-min 


Fab 


D 


V-min 


Fab 


n 






Val Id 


0.1 


0.0 


0.1 


0.6 


0.0 


0.6 


junction 8l-a2 




Val 53 


0.2 


0.0 


0.2 


0.3 


0.0 


0.3 




m 


lie 56 


0.4 


0.0 


0.4 


0.3 


0.0 


0.3 




V 


Val 71 


0.5 


0.0 


0.5 


0.7 


0.0 


0.7 


junction 82 -s3 




lie 79 


0.8 


0.0 


0.8 


0.1 


0.0 


0.1 


junction s3-b4 




He 106 


0.2 


0.0 


0.2 


0.2 


0.0 


0.2 






IMCP 


Accessible Fraction 


Hvdration Free 


Enercrv 


fkcal/molf*) 






V-min 


Fab 


D 


V-min 


Fab 


D 






Val 19 ^ 


0.5 


0.1 


0.4 


0.7 


0.2 


0.5 


. . 




Val 26 


0.1 


0.0 


0.1 


0.6 


0.0 


0.6 jx 


motion sl-s2 


• 


Val 78 


0.6 


0.0 


0.6 


0.8 


0.0 


0.8 jt 


motion s2-b3 




Leu 95 


0.5 


0.4 


0.1 


0.3 


0.1 


0.2 






Leu 104 


0.1 


0.0 


0.1 


0.2 


0.2 


0.2 






He 119 


0.2 


0.0 


0.2 


0.2 


0.0 


0.0 






1.5.5.5 

> 


Buried 1 


Polar Re 


ai-duee 












If the a< 


ccessible 


surface 


eurea of a pole 


ir residue is le 


ss than 10 A 




(ie that 


of a water molecule) (8) 


, or if 


the fre< 


5 energy 


of hydration 



is positive, the environment must satisfy the hydrogen bonding 
requirements of the residue (11) . 



i 



PMA nano f*1 
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F19 . 9 



9 




Accessible 


Area 




Hvdration 


Free 


EnercrvfkMi /moi 






V-min 


Fab 


D 


V-min 


Fab 


D 




Thr 


7 


0 


0 


0 


0.0 


0 . 0 


0.0 




Asp 


40 


0 


0 


0 


0 - 0 


0 0 


0 . 0 




Glu 


55 


2 


i 


1 


2 6 




9 o 

^ • e 


?? 


mm 


92 


n 
u 


X 


-1 


-0.1 


-0.1 


0.0 




Lys 


97 


56 


38 


18 


0.2 


1.2 


1.4 


?? 


Ser 


101 


6 


9 


-3 


-0.6 


-0.6 


0.0 




Gin 


131 


2 


0 


2 


-0.1 


0.0 


-0.1 




Arg 


137 


5 


2 


3 


-0.5 


-0.1 


-0.4 




Thr 


13B 


0 


21 


-21 


0.0 


0.2 


-0.2 




Thr 


143 


3 


0 


3 


-0.7 


0.0 


-0.7 





IMCP Accessible Area A" Hydration Free Enercrv (kcal/mol^) 

V-min Fab . D V-min 

Asn 9 1 0 0 0.0 -0.1 



D 

0.1 
0.0 
0.0 

Lys 101 54 44 10 0.4 0.0 0.4 



14 0 0 0 0.0 0.0 

Gin 47 0 0 0 0.0 0.0 



77 



Gin 108 9 12-3 -0.4 -1.3 0.9 77 

Gin 143 0 0 0 0.0 0.0 0.0 

Asn 144 0 0 0 0.0 0.0 0.0 

145 2 2 0 -0.2 -0.3 0.1 

T^ 3 0 0 0.0 0.0 0.0 



1 • 5 • 5 « 6 SUBBDAjry o£ 

The potential energies and hydrogen bonding compositions of the V-min 
structures are in the eaqjected range (8) . 

The only dihedral angle that may require further investigation is 
F19.9 V-min Arg 108 0 which is positive and is a Gly in IMCP. 
For both V-min structures, three non-polar residues are slightly more 
exposed than their Fab counterparts. These occur at junctions between 
sequence sections that have been connected in the modelling process. 
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There are also slight distortions in peptide, omega values in these 
regions. The deviations are relatively small and are unlikely to 
present major structural problems. 

Similarly two polar residues have been identified in both V-min 
structures as possible problems, however inspection of their 
environments (Figures 10-11) suggests that the polar groups are 
adequately bonded. 

In terms of these "modelling" criteria, both structures appear to be 
feasible. 



1.6 



Further V-min constructs 



restpration of potentially important framework residues 
was alluded to earlier and coinprises a four segment construct: 

F! 19 . 9 H71-lll/HlS-67/Ll-7/I*21-104 



ii) If a cysteine were to be required at the 'base of the V-min in 
order to facilitate linkage to an effector protein, the presence of 
two. disulphide bridges. (F19.9 6-27, 81-129) may complicate 
purification, refolding etc. The disulphides are relatively buried 
and in an extended conformation which suggest the use of valine or 



• • • % 



As indicated earlier, the core segments required to present 



the six CDRa can be chosen such that six core 



gives rise to additional 
evaluated as shown 



segments are used. This 
combinatorial possibilities which can be 



earlier for the four core segment approach 



i^) In addition it is possible to consider the generation of 

human Fv frameworks based on genetic families or clans (13) . Mouse 

CDR'a can then be grafted on taking into account the requirements for 

particular framework residues to control CDR loop conformation. The 
« 

resultant Fv sequences can readily be convierted to V-min constructs as 
shown earlier. 
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V) Minimal sized antigen combining molecniles could be generated 

as two or more chains . The simplest such molecule would be of the type 



Light Chain 
Heavy rhj^-jn 
of 140. 



I183-IO6/IJ.S-57 
H87-109/H15-S4 



eg 67 canonical residues 

eg 73 canonical residues for a total 



1.7 



Sequence for a consRrvative v-min ha«»ri on D3 t ar^i-^K^^, 



The vmin invention involves the generation of a single-chain Pv-like 
recognition protein by altering the order of appearance of sequence 
segments that display the CDRs. By judicious paring of non-essential 
segments, the size of the protein can be reduced to as few as 160 
residues. For a conservative construct, the aim is to conserve more 
of the original Pv and reduce the new splice junctions, thus resulting 
in a larger protein (for example upto 210 residues) , One advantage of 
the conservative approach is the retention of framework residues which 
may influence CDR loop conformation or participate in induced fit 
responses to binding. With these considerations in mind analysis of 
the x-ray structure for D1.3 Fab resulted in the following 
conservative Vmin constriict : - 



HS5-H11S :HX3-H64 :L1-L108 



The light chain segment remains intact and there are two 
splice- junctions where remodelling is required. A loop, taken from 
pseudoazurin {2paz 34-39) served as a template for the first junction 
and two loops (leucine binding protein 21bp lOS-lis, chymotrypsin 
inhibitor 2ci2 42-48) provided two possible conformations for the 
second junction. Thus two models have been generated, both are 
energetically and geometrically feasible and exactly correspond to the 
original Pv with the exception of the two splice- junction loops and 
the absent N- terminal residues from the heavy chain (HI-H12) . 
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In comparison with the Fv, more non-polar surface is exposed (ca 
,2 

130A ) , and much of this arises from the two leucines associated with 
the remodelled splice- junctions . There may be a case for replacing 
these leucines by alanine or serine to reduce the exposed non-polar 
surface (Figure 40) . Example 4 herein sets out expression of a Dl.3 
V-min construct. 

In sequence number 54 there is indicated on a 01.3 Vmln (conservative) 
an terminal secretion sequence (amino acids 1 to 22) as used by 
Winter et al (Nature 1989 341 544-546) in the successful cloning of Fv 
fragments used for crystallography {J.Mol.Biol. 1990 213 617-619). At 
the C-terminus there is included the c-myc tag sequence (amino acids 
234 to 244) used previously. The changes to the splice junction loops 
are also indicated, but not the possible replacement of the leucines. 



ggCAMPLE 2 GSNBRATZOK OF V-BIIN E3CPRBSSZ0K PLASUZDS 

2.1 Design of an MePC6Q3 V-min Synthetic Gene 

We have designed an MCPC603 V-min synthetic gene based on the protein 
sequence defined by the model. A major consideration was the use of 
E. coli preferred codbns but it is possible to include other features 
without coir5)romising this objective. The gene was designed for 
intracellular expression in E . coli using expression vectors set out 
herein but has inherent flexibilty for cloning into other systems 
such as secretion systems as set out in example 4 herein. Protease 

strains of E . coli may be considered for es^ression. 



2.1.1 Preparation of a V-min synthetic gene from antibody McPCS 03 

A DNA seq[uence (Figure 20) encoding the amino -acid sequence of the • 

« 

polypeptide of Figure 21 Pi which is based on the modelled structure 
« 

for V-rain was designed with the following features : 
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Codons normally chosen as those mo^t commonly used in highlv 
expressed E . coli genes. 

2) A series of restriction endonuclease sequences flanking and 

throughout the gene to facilitate f lexibilty in cloning and subsequent 
genetic manipiilation. 

3^ * Translation initiation and termination codons . 

^) A sequence deisigned to minimise potential secondary 

structure within both component oligonucleotides and predicted 
messenger RMA. 

5) Single-stranded cohesive termini to allow ligation at 

suitable sites within a plasraid. 

2-1.2 Seouence compilation. 

The amino- acid co-ordinates specified by the V-min model were used to 
compile the DNA sequences for the synthetic gene using existing DNA 
sequence data. The DNA/amino-acid sequence data for the Fv fragment 
of MCPC603 is published (Skerra et 1991, Bio/Technology 9, 

p273-278) • The inclusion of a c-myc peptide tag (Ward et al, 1989, 
NatTore 34^, p544-546) at the C-terminus is usefia for subsequent assay 
ptirposes. There are no antibodies available which would be capable of 
recognising the framework regions of a range of V-mins and any CDR 
specific antibodies would have restricted use and could interfere or 
be blocked in an assay. 

The verified sequence was then checked for the presence of codons 
rarely \ised in E . coli ( according to Andersaon and Kurland, 1990, 
Microbiol. Rev. 54, pl98-210) and the relevant changes introduced. 
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2.1.3 Introduction of unique restriction sites. 

To maximise the flexibility of the gene we attempted the introduction 
of unique restriction sites flanking each CDR region- This will allow 
the subsequent replacement/shuffling of the binding domains by a pair 
of con^limentary oligonucleotides in the form of a restriction 
fragment. The premise was that the V-min framework may be the basis 
on which to build subsequent molecules with altered/alternate binding 
specificities. This could be in the form of modified or completely 
different CDRs. Such changes could be tested by protein modelling 
prior to any experimentation. 

Using the MAP/SILENT programme in the University of Wisconsin GCG 
software package (Devereux et al,l5B4, Nuc. Acids Res. 12 , p387-395, 
hereinafter refered to as "Wisconsin") , the V-min sequence was 
searched for potential restriction sites which could be introduced by 
codon changes but without causing amino*acid substitutions. From the 
results of all potential restriction sites a choice of iinique sites 
flanking the CDR regions , which do not compromise the E.coli codon 
preference, was made. 

Codon usage was again derived from this sequence and the relevant 
balance between the various codons as quoted in codon usage tables 
adjusted without compromising any of the introduced restriction sites . 

2.1.4 Cloning etrateov. 

The phagemid vector pBluescript KS- (Short et al, 1988, Nuc. Acids Res. 
l£, p75B3 and available f rom Stratagene) was chosen as the initial 
cloning vector to provide a permanent source of the gene. The 
complete nucleotide sequence of pBluescript KS- is available in the 
GenBank Genetic Sequence Data Bank, Bilofsky et al, 1986, Nuc. Acids 
Res. 14/ pl-4, filename VecBase(3) :BlueKSm, accession number VB0077 
(Fig 22) . This vector has the advantages of high copy number for ease 
of plasmid purification, colour selection for recombinants and the 
ability to generate single- stranded DNA which- is useful for 
introducing subsequent manipulations via site-directed mutagenesis. 
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The maimer of insertion into this vector can.be achieved in 
non- expressing orientation. 

The strategy allows for insertion of a synthetic V-min gene fragment 
between the Kfial and BamHS sites of the vector. By including 
additional unique restriction sites on the synthetic fragment which 
flank the gene, high flexibility for subcloning from this source is 
possible. After cloning into pBluescript, the majority of restriction 
sites within the gene remain unique. 



Subcloning int o exnregsion vectors . 

The gene can be cloned into pici expression vectors (described below) 
on a Seal (blunt -end) to Xbal restriction fragment betiween the Konl 
(T4 polymerase treated to create a blunt end) and Xbal sites . The 
gene can be subsequently transf ered to vectors with different 
promoters by simple restriction fragment swap such as for example Tee 



The gene design also allows subcloning into any vector which makes use 
of an Ndel site to create the ATQ initiation codon. In this case a 
single N- terminal amino-acid addition (Ser) would be created. 

Cloning into E^cQli secretion vectors would be usefxa in that any 
sequence modifications could be readily assessed in a binding assay. 
This would allow rapid generation and screening of several constructs . 
N- terminal amino-acid substitutions would result from cloning into the 
lac promoter/pelB leader vectors used for antibody fragment expression 
vectors (Gussow et aJ,, 1989, Cold Spring Harbor Symp . Quant . Biol LTV, 
P265-272) . These substitutions have been modelled and are not 
considered to significantly alter the folding of the molecule . 
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Translation ini tiation considerations. 

A DNA sequence file was compiled for the V-min gene in trp and 
lambdap^ promoter vectors with two different ribosome binding site 
(BBS) sequences. In each case, the programme W0U3 in Wisconsin was 
• used to predict mRNA secondary structure over the first 100 bases of 
message. Whilst not entirely predictive, this programme can be 
employed to verify translation initiation problems and can alert the 
researcher to any potential problems with a given sequence. After 
running this progranmie minor changes were made to the 5' region of the 
gene to remove a potential stem- loop which reduced the availabilty of 
the AUG initiation codon. These changes did not compromise the 
previous design features. The new stem- loop predictions have a low 
free energy and allow good access to the initiation codon. 

^•^•^ Se«^""f'«rv stru cture Ponsiderai^ipmg 

The gene was divided into the 20 oligonucleotides (Seq.lD No. a - 20) 
Shown in Figure 20/Table 2 on an arbitrary basis. The Wisconsin 
programme STEMLOOP was then used to identify potential secondary 
structures within the gene. These secondary structures can cause 
problems in the correct annealing of complementary oligonucleotide 
pairs during gene synthesis. Any significant stem- loop structure 
occuring within a particular oligonucleotide sequence was destabilised 
by base changes which did not compromise the main design 
considerations . 



2*2 Prepa^a^ion of Q liaonuelgptidea 

The oligonucleotide sequences shown in Table 2 were prepared using 
known methods such as on an J^plied Biosystems 3 8 OA DNA synthesiser 
from 5 ' dimethoxytrityl base-protected 

nucleoside-2 - cyanoethyl -N, N-di - isopropyl - phosphoramidites and 
protected nucleosides linked to controlled-pore glass supports on a 
0.2 /imol scale, according to protocols supplied by Applied Biosystems 
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Inc. Altemacive apparatus includes Applied .^Biosystems 380B or 394 
DMA synthesisers using 5 ' dimethoxytrityl base -protected 

nucleoside-2-cyanoethyl-N,N-di-isopropylphosphoramidites and protected 
nucleosides linked to controlled-pore glass supports on a 0.2 ^mol 
scale, according to protocols supplied by Applied Biosystems Inc. 

Alternatively, the oligonucleotides can be prepared manually 
by the methods described by Atkinson and Smith in "Oligonucleotide 
synthesis, a manual approach- (M.T.Gait, editor, irl Press, oxford. 
Washington DC. pages 35-81) . 

The purification of the oligonucleotide sequences subsequent to 
automated synthesis was as follows: 

Each synthesised oligonucleotide, after cleavage from the solid 
support and removal of all the protecting groups, was dissolved in 
double distilled water (iml) . An aliqout (400m1) of each 
Oligonucleotide was taken and the DNA precipitated by the addition of 
SOMl of 3M sodium acetate. pHS.S and iml absolute ethanol. 
incubation on ice or at -20-C for a minimum of. is minutes, the 
precipitated DNA was pelleted by ceatrifugation at I3,000rpm for 10 
minutes. The supernatant from each pellet was discarded and the 
pellet washed in iml of 70% ethanol (in water) . re-centrifuging for 5 
monutes. Pellets were then dried under vacuum before resuspending in 
20^1 distilled water and 5^1 of a formamide/dye mixture (lOmM NaOH 
O.SmM EDTA, 0.01% each of bromophenol blue and xylene cyanol and 80% 
formamide) . The samples were then electrophoresed on a 10% 
polyacrylamide gel containing 8.3M urea in loOmM Tris-borate buffer 
2inM EDTA (pHe.3) . 
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After electrophoresis, the full-length oligojiucleotide product was 
excised from the gel after visualisation by UV-shadowing (Narang e£ 
al, 1979, in Methods in Eazymology, £8, pp90-98) . This was generally 
the major product of the synthesis although products of intermediate 
size were also visible. The DNA was electro -eluted from the 
acrylamide in a dialysis bag containing 400^1 of lOmM Tris-borate 
buffer, o.2mM EDTA (pH8.3) at 300v for 2 hours. The eluate was 
collected and the dna recovered by ethanol precipitation as described 
above. Alternatively, recovery of the DNA can be achieved by the use 
of proprietary kits such as Spin-X filters (Cos tar) with the protocols 
recommended. by the manufacturers. The concentration of each purified 
oligonucleotide solution was estimated by scanning a diluted sample 
from 300 to 220nm in a UV spetrophotometer and using an extinction 
coefficient of lOD^g =20/ig/ml. 

2.3. Gene Assembly 

Oligonucleotides Seq.ID 2-19 (100-400pm of each) were phosphorylated 
with T4 polynucleotide Icinase (50 units, Pharmacia Biosystems Ltd. ) in' 
a solution containing ATP (BfM) , g-^^P ATP (SOpM. New England Nuclear, 
SOOOCi/mmol) , spermidine (lOOAiM) , MgCl^ (20mM) , Tris-HCl, pH9.0 (50mM) 
and EDTA (O.lmM) at 37«c for Ihr. The effectiveness of the 
phosphorylation reaction was assessed by electrophoresing a 2^1 sample 
of each reaction on a 10% polyacrylamlde gel containing 8 . 3M urea in 
lOOmM Tris-borate buffer, 2mM EDTA (pH8.3) followed by autoradiography 
of the gel. The reactions containing oligonucleotides sag. id lo and 
20 were incubated at 70 -c for 10 minutes to inactivate the T4 
polynucleotide kinase thus preventing premature phosphorylation of the 
complementary partner upon mixing. 
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The phosphorylated oligonucleotides were then mixed in conple„,entary 
pairs as shown in table l to produce the duplexes A to j. The two s- 
Oligonucleotide secjuences (seq.ID 1 and IX, were used unphosphorylated 
(200pni m 25^1) to prevent concatomerisation of the final or 
intermediate gene products . The mixed oligonucleotide pairs were then 
ethanol precipitated as previously described. Each pair was 
resuspended in lOO^l of distilled water and the complementary 
sequences annealed by heating to lOO-C for 1 minute before gradually 
cooling to room temperature over several hours or overnight. 

The annealed pairs were then mixed into 3 sets as shown in Table 2 
The volume of sets II and III was increased to 400^1 by the addition 
Of 100.1 Of distilled water. Protein contamination was then removed 
from each set by treatment with Strataclean resin (Stratagene) in 
accordance with the manufacturer's instructions. Alternatively, this 
can be achieved by extraction against an equal volume of phenol The 
DIIA, was again recovered by ethanol precipitation and the pelleted DNA 
dried \mder vacuum. 

The DNA in each set was than ligated together by resuspension in 30.1 
Of a solution containing T4 DNA ligase ( i unit; Bethesda Research 
laboratories,, SOmM Tris-HCl. pH7.6. lOmM Mgci^, s% w/v polyethylene 
glycol 8000, i„M ATP and imM DTT (BRL, Focus. winter 198S, at l5-c 
for a minimum of i.s hours. The ligated DMA from each set was again 
recovered by ethanol precipitation, resuspended in 20.1 distilled 
water and 5.1 formamide dye mix and each set electrophoresed on a lo% 
polyacrylamide/8.3M urea gel. Bands of the appropriate length 
(1S3.217 bases, were identified by autoradiography, excised from the 
gel and recovered by electroelution and ethanol precipitation These 
products were then annealed, the sets pooled, the DNA recovered by 
ethanol precipitation and then ligated to generate the complete gene 
sequence in a repeat of the above processes. 
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Recovery of the intact gene sequence can be MchievBd by electrophresis 
of the ligation products on a 4* polyacrylamide/8 . 3M urea gel. 
autoradiography, excision of the appropriate bands, electroelution and 
ethanol precipitation. The recovered DMA then requires, annealing to 
produce the double -stranded gene fragment. Alternatively, a sample 
(l/il) can be added to a polymerase chain reaction (PCR) containing 
oligonucleotide seq.ID Nos. 1 and 11 1 0. 25 fM of each), dATP. dCTP, 
dGTP and dTTP (0.2mM of each). KCl (SOmM) , Tris-HCl, pH8.3 (lOmM) 
MgCl^ (l.SmM), gelatin (0.01% w/v) and Taq polymerase (2.5 units, 
Cetus Amplitaq) and incubating at, for example, 94 «C for l.S minutes, 
S5»C for 1 minute and 72 for l.S minutes for 25 cycles plus 72 'C for 
5 minutes in a programmable heat controller (Techne PHC-l) . The 
resulting single, an?)lified product can be purified from excess by 
spun column chromatography using a Miniprep spun column (Sephacryl 
S-400. Pharmacia Biosystems Ltd.) or other proprietary equipment in 
accordance with the manufacturer's instructions and recovered by 
ethanol precipitation. 

■ 

Prior to cloning the isolated gene must be phosphorylated (as 
described for the individual oligonucleotides) to allow ligation with 
a suitably digested cloning vector. 

2-4 Cloning the Svnf-hPH-i 

sed Gene 



1 



The synthetic gene was designed with cohesive termini which allow 
insertion into a general cloning vector digested with the restriction 
enzymes SjQl and BanjHl such as the phagemid pBlueacript (see "Cloning 
Strategy" above and fig 21) or any other suitable phagemid or plasmid 
vector. Cloning into the KesI site does not result in regeneration of 
a restriction site (GGTACA) whereas the BamHl site is retained. 
Additional restriction sites are included in the synthetic gene both 
5' and 3' to the actual coding sequence to allow flexibilty in 
siib-cloning for expression. 
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A sarnple of pBluescrip. ks- (s,g, was digested with the enzymes K^z 
(25 units, Bethesda Research Laboratories) and BamHI (24 units 
Amersham International) in a 100^1 solution containing lOmM 
Xris-acetate buffer. pH7.5, 10 magnesium acetate and SOmM potassium 
acetate (provided in lOx concentration as One-Phor-All buffer PLUS. 
Pharmacia Biosystems Limited) . The reaction v^s incubated at 37oc'for 
1.75 hours, control samples of the phagemid (O.SMg) were digested 
with each enzyme individually in a 10^1 volume under identical 
conditions. After digestion, seniles of each restriction digest (o a. 
- O.SMg) are electrophoresed on a 0.8% agarose gel in a solution 
containing 20mM Tris-acetate. pHB.O, o.SmM EDTA at lOOV for 30 minutes 
followed by staining with ethidium bromide (XMg/ml) and visualisation 
under UV illumination. Digestion with each enzyme in the double 
digest was confirmed by the presence of both major (2896bp, and minor 
(6Sbp) bands and by con5,lete linearisation with each enzyme 
individiial 1 y . 



The double digest was then treated with Strataclean resin (Stratagene) 
to remove proteins using the protocol provided by the supplier and the 
0NA recovered by ethanol precipitation. The purified DNA pellet was 
resuspended in 25^1 of distilled water and 5^1 of a solution 



containing 25% sucrose. SmM sodium acetate, o.l% sodium dodecyl 
sulphate and 0.5% bromophenol blue before electrophoresis on a 3% 
NuSieve GTG agarose gel (fmc Bioproducts) in 20mM Tris-acetate pH8 0 
and O.SmM EDTA at 80V for l hour. The DUA was visualised under uv ' 
xllumination after staining the gel in ethidium bromide solution 
(iMg/ml) . the major vector band excised from the gel and the DNA 
recovered by phenol, phenol/ chloroform and chloroform extractions b 



the procedures provided by the suppliers of the agarose. Alternative 

methods for isolation and ntir-i ^4 * 

ana purirication of DNA fragments are provided 

xn "Molecular Cloning - a laboratory manual". Maniatis, Pritsch and 
SambrooJc, published by CSH laboratory, second edition 1989 and 
hereinafter referred to as -Maniatis" (Chapter 6. p22-35) After 
recovery, the DNA was resuspended in 10^1 of distilled water 



9412G2SA2_I > 



wo 94/12625 PCT/GB93/02375 

- 43 - 



Alternatively, isolation of the synthetic gepe via PCR results in a 
product which can be cloned into a vector digested with a restriction 
enzyme which creates blunt ends such as Smal digested Ml3iiipio 
(Messing, j., xs83. Methods in Enzymol.lOl. p20) available from 
Amersham International . M13inpl0 can be purchased (from Amersham 
International) already digested with Smal and de-phosphorylated with 
alkaline phosphatase to prevent re-circularisation. Alternatively any 
suitable vector can be digested with a restriction enzyme generating 
blunt ends and then treated with calf -intestinal or bacterial alkaline 
phosphatase using protocols provided by the supplier of the 
appropriate enzyme. 

Cloning of the isolated synthetic gene was achieved by ligation with a 
san^jle (50-lOOng) of the KesI and BamHI digested pBluescript KS- 
vector in a solution containing T4 DMA ligase ( x unit; Bethesda 
Research Laboratories), 50mM Tris-HCl, pH7.6, lOroM Mgci , 5% w/v 
polyethylene glycol 8000, imM ATP and imM DTT which is incubated at 
.15»c for a minimum of 2 hours. Alternatively the PCR generated 
synthetic gene is ligated with the final cut and dephosphorylated 
M13rapl0 under similar conditions. After ligation, the DNA can be used 
to transform con5)etent E . coli cells such as strain DHSa (a Dlac 
derivative of strain DHl, Hanahan, D.. 1983. J.Mol.Biol. 166 
PS57-S80) . for PBluescript and strain JMiOS (Yanish- Perron et al, 1985, 
Gene pios-iis) for MlSmplo. The competent cells can be purchased 
from a number of siqjpliers such as Bethesda Research 
Stratagene. Alternatively, procedures for the generation and 
transformation of competent cells are provided in Maniatis, Chapter l 
P74-84 and Chapter 4 p33:38. The vectors used can transform E.coli 
cells to ampicillin resistance (lOOMg/ml) or in the case of Ml3n?>io to 
generate plaques due to growth inhibition of infected cells. 



RNA n»nA 4^ 
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Havxng selected ampicillin resistant: colonie? or plaques, these can be 
propagated and DNA purified for analysis. With both vectors, double 
stranded phagemid or replicative form (rf) dna can be recovered from 
cultured cell pellets. Alternatively single stranded DNA can be 
prepared by rescue of phage particles using helper phage (such as R408 
described by Russell e£ ^, XSSS , Gene ^. p333-338, from phagemid 
containing cells or directly from the supematants of cells infected 
with Ml3mplo. standard protocols for the purification of double- and 
sxngle-stranded dna are provided in Maniatis (chapter l, p2X-S2 and 
Chapter 4. p26-32) . Double-stranded DNA can be used for analysis by 
restriction enzyme digestion to identify clones with inserts of the 
appropriate size. Both types of DNA can be used for DNA sequence 
analysis by the dideoxy chain termination method of Sanger 
(Proc.Nat.Acad.Sci. USA 74. 1977. p5463) using a proprietary 
sequencing kit such as the Sequenase Jcit stalled by United States 
Bxochemical Company and used in accordance with the protocols provided 
and analysed by autoradiography after high resolution polyacrylamide 
gel electrophoresis (Sanger and Coulson, 1978. FEBS lett.^. pi07) . 

one plasmid clone identified as containing the gene ae™«r,^ 

= sene sequence was named 

PMIN603. The construction of this plasmid is outlined in fig 23. 



Constnigtion of ig.coli ^yp^oo 



sion vegi 



The V-min gene can be cloned into a number of expression 
vectors for intracellular accumulation. Such vectors can allow 
constitutive' expression from a trg promoter (Bennet and Yanof sky 
1978. J.Mol.Biol., I2i, P179.192) or regulable expression from a' 
lambda promoter under the control of a temperature sensitive CI857 
repressor gene (Remaut et al, isas. Methods in Enzymology 

P366-383) . suitable alternative promoters can also be employed for 
this purpose. 
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Variants of these vectors can be prepared in. which the only difference 
is in the DNA signal for translation initiation site or ribosome 
binding site (RBS) . These variants can be compared for optimal 
acciimulation level which can vary with different RBS sequences. 
2.5.1 PICI0020 

Plasmid vector pICI0020 is a pAT153 based plasmid vector in which the 
651bp EcoRl-AccI region is replaced by a I67bp EcoRl- cia l fragment 
consisting of : - 

1) a synthetic E.coli trp promoter and trp leader ribosome binding 
site 

2) a translation initiation codon 

3) a multiple restriction enzyme recognition seopience derived from 
M13mpl8 containing sites for Kpnl, BamHI, Xbal, Sail, PstI, SphI and 
Hindlll . 

4) a synthetic transcription termination sequence. 

The construction of a plasmid vector containing a synthetic trp 
promoter sequence is published (Windass et al Nuc, Acids Res. 10 
p6 639-6657, 1982). A promoter fragment was isolated from such a 
vector after digestion with the enzymes EcoRI and Hna l and 
purification of the appropriate band from an agarose gel by 
electro-elution (in Maniatis, Chapter 6 p28) . 

A pair of complementary synthetic oligonucleotides were prepared which 
would ligate to the Kg^Z end of the promoter fragment providing the 
natural trg leader ribosome binding site, a translation initiation 
codon and a 3' KanI cloning site. These oligonuleotides were mixed in 
equimolar concentrations and allowed to anneal by heating to 100 »C 
followed by slowly cooling to room temperature. 
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The promoter fragment and annealed oligonucl.eotides were then ligated 
and the appropriate band isolated from a polyacrylamide gel by 
electroelution. This fragment was then ligated with an M13mpl8 vector 
derivative containing the tre attenuator sequence (generated from 
synthetic oligonucleotides) cloned into the HisdIIl site and 
introducing an additional £1^1 restriction site 3' to the attenuator. 
The ligated DNA was transfected into E.coli strain Jm09 
(Yanisch- Perron et al Gene, 31, pi03, X985) made competent by the 
CaCl^ method (Maniatis, chapter lp82) . After plating out and 
incubation of the plates, plaques were screened by the method of 
Benton and Davies (Maniatis, chapter 4p41) using a "p labelled probe 
generated by .nick translation of the EcaRI-aBal promoter fragment 
isolated previously. Single, stranded ONA was prepared from positively 
hybridising plaques by a standard method (Maniatis. chapter 4p29) and 
sequenced using the M13 universal primer and the Sanger dideoxy chain 
termination method as provided in kit form by a number of 
eg. Sequenase (Uhited States Bioscience) . 

RF DNA was prepared from one isolate in which the promoter/ribosome 
binding site/attenuator sequence had been confirmed. This DNA was 
digested with EcaRl and £1^1 and the appropriate fragment isolated 
from a polyacrylamide gel as above. Plasmid pATlS3 (Twigg and 
Sherratt, isso. Nature 283, p216-218) was digested with the enzymes 
EcoRI and Assi and ligated with the isolated promoter fragment. 
Ligated .DNA was transformed into competent E.coli HBIOI (Bethesda 
Research Laboratories) and ampicillin resistant colonies selected. 

Plasmid DNA from several clones was prepared and DUA sequence derived 
from the region between the Ss^l and Sl^i sites. One clone confirmed 
as containing the correct promoter/attenuator region was named 
PICI0020. This construction is outlined in fig. 24. 
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2*^*^ Preparat ion of Plaamid prm m 

The plasmid pCHlOl corresponds to pici 0020 except that the 
EcoRI-Sall fragment is .replaced by a fragment consisting of the SEQ id 
NO 37 (see Figure 25) and the interferon ct^ gene sequence as described 
by Edge M.D. et al. Nucleic Acids Research 1983. Volil, p64l9-6435. 
in this regard the 3' -terminal ATG codon of SEQ ID Ko 37 immediately 
precedes the TGT codon which codes for cysteine (amino acid 1) in the 
interferon sequence of the above-mentioned Edge M.D. et Nucleic 
Acids Research reference. The 5' nucleotide sequence GATCCATG and the 
complementary 3/ nucleotide sequence GTAC are thus omitted from the 
nucleotide sequence of the aforementioned reference. 

2.5.3 PICI0042 



PICI0042 is a plasmid in which the antibiotic resistance markers of 
PAT153 have been replaced by a single, inducible tetracycline 
resistance gene from the plasmid RP4 (encoded by the gene tetA and 
regulated by the product of the tetR gene) . These genes have been 
characterised by Klock et al (J. Bacterid . 161 p326-332, 1985). A 
plasmid stability function (oer) has also been incorporated. This 
obviates the requirement for fi- lactam antibiotics in cultures of cells 
bearing this plasmid. Because the new resistance marker is only 
expressed in the presence of antibiotic, the tetA gene product will 
not be a potential contaminant in cultures where the plasmid is 
stabily maintained in the absence of tetracycline. 

The initial stage in the generation of this vector was to produce a 
derivative of .pATlS3 from which the gene encoding tetracycline 
resistance had been completely removed. A complementary pair of 
synthetic oligonucleotides were designed to replace the EooRl-Aval 
fragment from pAT153 with a short sequence containing several unique 
restriction endonuclease sites for subsequent cloning. ' 
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PAT1S3 plasmid DNA was digested with the enzymes EcoRi and Aval and 
the 2.l75Kbp plasmid DNA fragment isolated from a 0.7% agarose gel 
usxng Geneclean (Bio loi, California) in accordance with the 
manufacturers instructions. The l.425Kbp fragment containing the 
tetracycline resistance gene is thus removed. 

The oligonucleotides (fig 27/27A) were phosphorylated using T4 
polynucleotide kinase and equimolar amounts annealed together a 
sample of the annealed oligonucleotides was then ligated with the 
plasmid fragment from pATlS3 . Ligated DMA was transformed into 

competent E, coli HBlOl (BRi.) anri am«4«^n • 

^ ^ M-Luj. Katoa) and ampicillxn resistant colonies 

selected. 

several colonies were picked for small-scale plasmid preparation 
by the method of Bimboim and Doly (as specified in Maniatis , chapter 
XP25, and the desired construction identified by restriction analysis 
wxth suitable enzyme, eg. EcoRl, Av^I and MmHl. The structure of 3 
xsolates identified as having the correct restriction pattern was 
confirmed by dka sequence analysis using a- pBR322 E^oKl site clockwise 
primer (New England Biolabs) . One isolate was named piciOOlS. 

RP4 plasmid DNA was isolated from extant stocks by the method of 

Holmes and Quigley (Maniatis. chapter lp29, . This D«A was cut to 

completion with B^ill and then partially with Xmal (at 250c for up to 
35min) — ^ 



samples at various timepoints until a 2.45 Kbp fragment 
containing the tet£ and te£^ was clearly identifiable, a sample of 
pace nUA (Vanisch- Perron ^ ^, ,sss. Oene 3^, .pxo3-iis and available 
fro™ Amersham International, was digested to con^letion with SamHI and 
Snal . Ligations were performed to insert the tetracycline resistance 
genes into the pUC8. Ligated DNA was transformed into £^ C600 
(Appleyard, R.K. Genetics 23. P440, 1954) made competent by the CaCl 
method (Maniatis, chapter ipa2, and tetracycline resistant colonies' 
selected. Plasmid D«A was prepared from 8 clones (Holmes and Quigley, 

and th? presence of the RP4 tetR and A oenes coafir^^A v 

A genes confirmed by restriction 

analysis. One of these isolates was named pTB344. 
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The tetracycline resistance genes were then .inserted into pIClOOls 
(described above) by replacement of an EegRi/PstI fragment from 
PICI0019 with the corresponding fragment from pTB344. This results in 
replacement of the majority of the ampicillin resistance gene in 
PICI0019 with the tetracycline resistance genes. After digestion and 
ligation of the plasmid DNAs. followed by transformation into E.coli 
C600, colonies were selected on the basis of phenotype ie. Tc^ and 
Ap . Plasmid DMA was prepared from 4 such clones and digested with a 
combination of enzymes eg. BamHI/Pstl/SstI . EcoRI/ Sal l. sma l. 
SEi^l/Sall and Ajsl/Pstl . All 4 clones produced restriction patterns 
consistent with the desired construct. One of these was designated 
PTB351. 

Summers and Sherratt (Cell 36 P1097-1103, 1984) have shown that the 
instability of plasmids derived from ColEI (eg. pAT153) is due to the 
loss of a 2B3bp sequence, cer, present in the parent plasmid. This 
sequence helps prevent the formation of plasmid oligomers, the latter 
appearing to disrupt plasmid partitioning in some as yet undefined 
way. The cer sequence (Summers, D. et al MGG 201. p334-33a, 1985) was 
kindly provided by Prof . D . Sherratt in the form of a fragment cloned 
into pUCia (PKS492) . PKS492 plasmid DMA was digested with BamHI and 
Tagi to release a 289bp cer- containing fragment. Plasmid pTB351 DNA 
(isolated from the dam" host E.eoll GM48 - Arra j , J.A. and Marinus, 
M.G. J.Bact. 1S2 P562-565, 1983) was digested to completion with lainHI 
and Cial and ligated with the. digested pKS492 DNA. After 
transformation of ligated DMA into competent E.coli CSOO, tetracycline 
resistant colonies were selected. Restriction analysis of plasmid DNA 
from putative clones with the enzymes ^I, £11341 and Pvu i was used to 
confirm the presence of cer. One isolate with the correct structure 
was named pICI0042. This construction is outlined in figs. 26 and 
27/27A. 
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2.S.4 



Insertion of an Expression Casisette infep piCTOnA? 



An expression cassette consisting of the trg promoter, a ribosoms 
binding site and the interferon gene was isolated from plasmid 
PCHIOI (see 2.5.2 above) on an EeoRI to Sjs^ restriction fragment. 
This was ligated into the vector pIC10042 (see above) similarly cut 
with EcoRI and SEhl- This DNA was used to transform a con5>etent 
culture of E. coli CSOO and tetracycline resiscant colonies were 
isolated. A few of these were tested by plasmid DNA preparation and 
restriction analysis for the acquisition of the Sstl restriction site 
carried on -the expression cassette. Clones positive in this respect 
were further tested by restriction mapping to check that the expected 
construct was correct. They were also checked for the conferred 
capacity to produce interferon protein as. analysed on a 



polyacrylamide-SDS gel stained with 
confirmed clone was designated pLBOOS. 



blue. One such 



2.5.5 



Insertion of T4 transcription termina»-r^r- -ir^^-^ 



(244 



The T4 transcription terminator sequence in the form of the Sail to 
HiaOllI fragment (see SEQ ID No. 38 and Figure 28) was inserted into 
the multicloning site of an intermediate vector pTB244 (described in 
European Patent Publication No. 237,269) between its S&2^ and Hindlll 
sites. Clone analysis was used to confirm the structure of this 
construct (pTB244-T4 ter) . Prom this vector, an flafei to Sfilil fragment 
containing most of the multicloning site and the T4 termiBa^n^ = 
then isolated. This was inserted into pLBOOS similarly cut with Ss£l 
and SEfel thereby substituting the interferon gene but leaving a 
cassette consisting of the ££e prranoter, multicloning site and T4 
terminator. This construct was confirmed by clone analysis and the 
plasmid designated pIiB013 (Figure 29) . 
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^•S.S Substitution of the multiclonina site 

The multicloning site in pLBOlS is not ideal for this vector in 
several respects: the Sail. BamHl and Smal sites are not unique, 
existing elsewhere on the plasmid. This fragment was therefore 
excised by cutting with SstI and Xbal (both unique) and synthetic 
oligonucleotides with the sequence of SEQ ID No. 40:- 

Ndel Ball I Seal 

5' AGCT CCATATGG TACC AGATCT CTCGA GAGTACTT 

GGTAT ACCATGG TCTAG AGAGCTCT CATGAAGATg 5' 
KESI Xho l 

were inserted in its place. Clones were analysed for acquisition of 
the new restriction sites and then confirmed by sequencing. One such 
plasmid was designated pLB014. The new cloning sites inserted in 
way are: Ndel, Kpnl, Hal II. Xhol and Seal with the previoiis Xbal and 
Sail following them. 

2.5.6,1 Further modification 

It was discovered that the adjacent Ssti and Ndel sites in pLB0l4 
coiild not be cut by both these' restriction enzymes either 
simultaneously or sequentially presumably because of their close 
proximity. An additional sequence was therefore inserted between 
them. This was done by cutting piaoi4 with SstI and Knnl and then 
inserting the synthetic oligonucleotide of SEQ ID No. 41:- 

5' AGCTCAGCTGCAGCATATGGTAC 
GTCGACGTCGTATAC 5' 
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Clones were analysed for acquisition of an extra Pvull or PstI site 
and then confirmed by sequencing. One such plasmid was designated 
pLBOlS (- PICI0080). This plasmid, unlike pLB014. is efficiently cut 
by S££i and K^l or Ndel. This is to provide a place to insert a 
variety of ribosome binding site sequences correctly positioned with 
respect to the upstream trs promoter and by digesting with Knn l and T4 
DNA polymerase treatment to generate a blunt end or by digesting with 
Jidsi, designed to provide the ATG start codon of the gene to be 



2.S.7 Altern ative promoters 

The trs promoter in pIClOOSO is excisable on an EcoR I to Sae l 
restriction enzyme fragment and can be replaced by a suitable 
alternative promoter element such as the regulable lambda P /cI857 
element from plasmid plCI1079 (Figure 31) which has been deposited 
under the Budapest Treaty at the National Collections of Industrial 
and Marine Bacteria Limited (NCIMB) , 23 St. Machar Drive, Aberdeen, 
AB2 IRY. Scotland, U.K. (NCIMB No. 40370, date of deposit X9 February 
1991) . A version with lambdaP^ promoter is called pICI123 (Fig 30) . 

Other E.coli promoter elements available on a compatible restriction 
fragment could also be substituted. 

^•^ Sub-clonino the S^mthetie V- min Gen^. -. nto an «.vT^>-o=.,-i 

vector 

The cloned synthetic gene in pMIll603 contains a number of unique 
restriction sites which can be used for siib-cloning the V-min coding 
sequence into an E.eeli (or yeast) expression system. The enzyme 
digests at the S' end of the gene to generate a blunt -ended fragment 
which starts at the N-terminal amino-acid codon (ACT=thr) . This can 
be fused to an expression plasmid which provides transcription and 
translation signals (promoter and ribosome binding site) and can be 
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blunt -ended immediately after a translation initiation codon 
(ATGomet) . Examples of such vectors are the pICI expression vectors 
described above. Alternatively, the unique Nde l site can be used to 
create a cohesive 5' terminus which can be cloned into an identical 
site of an expression vector such as pICIOOSO described above. In 
this case the translation initiation codon is provided on the 
synthetic gene fragment and expression results in an N- terminal 
extension (AGTsser) on the V-min protein. 

E . coli expression vectors which allow fusion to a secretory leader 
sequence may also be used such that the expressed V-min protein is 
directed to the E.coli periplasm ( as described by Skerra et al, 1991, 
Bio/Technology 9, p273-278) . In addition, secreted protein can be 
fused to the C-terminal portion of a bacteriophage coat protein (as 
described by Hoogenboom al, 1991, Nuc. Acids Res. 19 . p4133-4l37) 
to allow the generation of phage particles which display the V-min 
protein in active form on their surface as an aid to selection. These 
approaches may require the modification of 5' and 3' restriction 
enzyme sites by, for exan^le, PGR as described in the cited examples. 
A r,eview of the expression of antibody fragment molecules in E . coli 
has been published by Pluekthun, 1991, Bio /Technology 9^, p545-55l. 

The synthetic gene contains unique restriction sites for SnaBI, Bal ll . 
BsaBI and BamH I after the translation termination codon (TAA) to allow 
flexibility for cloning the 3' terminus into a range of expression 
vectors. In addititon, pMXN603 has additional cloning sites for Xbal, 
Not I, BstXI, Sad and BssH II 3' to the gene to further improve 
flexibility. 
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Expression vector picii23 was digested with (Bethesda Research 

Laboratories) in a solution containing 20mM Tris-HCl, pH7.4. SmM MgCl 
. and 50mM KCl at 37oc for 2 hours. The KebI digested DNA was then ' 
treated with T4 DNA polymerase (l unit/^g DNA Amersham International) 
in the above solution containing O.lmM each of dATP, dCTp! dGTP and 
dTTP at 12 0c for 15 minutes. This treatment makes use of the 3' to 5' 
exonuclease activity of the T4 polymerase to remove the 4 base 
3 'extension generated by KssI digestion resulting in a blunt end 
innnediately after the translation initiation codon. The reaction is 
then treated with Strataclean resin (Stratagene) to remove protein and 
the DNA recovered by ethanol precipitation. The DNA was then digested 
with Xbai (Bethesda Research Laboratories) in a solution containing 
SmM Tris-HCl, pHS.O, 6mM MgCl^, ISO mM Nad and ImM dithiothreitol 
(DTT) at 370C for 2 hours followed by treatment with Strataclean resin 
to destroy enzyme activity and recovery of the DNA by ethanol 
precipitation. The treated DNA was resuspended in 10^1 of distilled 



The plasmid pMlN603 was digested with Seal (Pharmacia Biosystems 
Limited) in a solution containing 2 OmM Tris-aoetate, pH7.5, 20mM 
magnesium acetate and lOOmM potassium acetate at 37»C for 2 hours. 
The reaction was then treated with Stratclean resin and the DNA 
recovered by ethanol precipitation. Th^ purified, Ssal digested DNA 
was subsequently digested with Xfeal as described for pICI123 above . 
The protein is removed by Strataclean resin treatment and the DNA 
recovered by ethanol precipitation. The digested DNA was then 
eletrophoresed on a 3% NuSieve GTG agarose geX and a DNA band 
corresponding to the McPC603 V-min fragment {523bp) isolated from the 
gel as described previously for the isolation of ^l/BamHI digested 
pBluescript KS- . The purified DNA fragment was resuspended in xo^l of 
distilled water. 
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A sample of the K2nl/T4 polymerase treated/ Xba l digested pICIl23 
(approx. 50ng) was ligated with the Sca l/ Xbal digested V-min gene 
fragment {50-lOOng) as described previously xonder "Cloning the 
Synthesised Gene". Ligated DNA was used to transform competent E,coli 
strain CGSC 6300 to tetracycline resistance. Plasmid DNA was prepared 
from selected tetracyline resistant colonies according to the methods 
in Maniatis previously refered to. Restriction digestion analysis is 
used to identify plasmids which contained a cloned insert consistant 
with the size of the McPC603 V-min gene. The cloning of the gene was 
then confirmed by DNA sequence analysis. 

One cloned plasmid with the correctly inserted V-min gene sequence is 
named pICI1572. The construction of this plasmid is outlined in 
fig. 32. 

Alternatively, similar techniques can be used to clone the V-min gene 
into other vectors for intracellular accumulation, secretion to the 
periplasm or for display on the surface of a bacteriophage. 

2.7 Expression studies 

Plasmid pICI1572 is used in expression studies. It is used to 
transform a range of E.coli host strains, some of which are protease 
deficient, to cissess accumulation levels in different genetic 
backgrounds . 

Cultures {75ml) are grown in MS medium (I2.8g/1 di- sodium hydrogen 

orthophosphate septahydrate, 3g/l potassium di-hydrogen 

orthophosphate , 0.5g/l sodium chloride, l.Og/1 ammonium chloride) plus 

0.2% (w/v) glucose and 0.02% (w/v) casein hydrolysate, in the presence 

of tetracycline (15 /ig/ml) , shaking at 37*>C for 7-16 hours. 

Expression is induced by increasing the growth temperatxire to 42«»C for 

2 hours prior to harvesting the cells. The OD^ of the culture is 

540 

measured in a spectrophotometer, the cells pelleted by centrifugation 
at lOOOx g for 5 minutes auid the cell pellet is resuspended in a 
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solution containing 62.5inM Tris-HCl, pHS.8, .1% (w/v) sodium dodecyl 
sulphate, 10% sucrose, SOmM DTT and 0.05% (w/v) bromopiienol blue to 
°°540'^° incubated at lOO-C for is minutes. A 20ai1 aliquot of 

each sample is then used for analysis by sodium dodecyl 
sulphate -polyacrylamide gel electrophoresis (SDS-PAGB) analysis as 
described in Maniatis (Chapter X8, p47-5S) . 

Expression of the MCPC603 V-min gene is detected by the presence of a 
additional band in cells containing clone plCIlS72 compared to cells 
containing the parent vector (pICI123) on Coomaasie-blue stained gels 
Alternatively, the incorporation of ^^S methionine into newly 
synthesised proteins can be used to identify additional proteins 
expressed by piciis72 clones compared to non-expressing control 
clones . The identity of additional proteins can be confirmed by 
electrophoretic transfer of proteins from SDS-PAGE to a nylon or 
nitrocellulose membrane using a Pharmacia Multiphor electro-blotting 
apparatus in accordance with the manufacturer's instructions followed 
by Western blot analysis using an antirc-myc tag antibody (9E10 as 
described below) and detection by standard procedures (Maniatis, 
Chapter 18, p69-75) or Auroprobe Bin- (Amersham International) using 
the protocols provided with these reagents. 

2.8 

Recombinant V-min expressed in E. coli is produced in 
either soluble form (particularly by manipulation of expression 
construct and growth conditions) or as insoluble inclusion bodies 
Secretion methodology can also be used (see Example 4) . 




BNSOOCID: <WO. 



^12e2SA2J_> 



BNS oaoe sa 



wo 94/12625 PCT/GB93/02375 

- 57 - 



Purification of soluble forms- of the protein, is accomplished for 
example by cell lysis (sonication or high pressure homogenisation) , 
centrifugation and then separation techniques known in the art in a 
variety of combinations. Known separation techniques include the 
following: 

ion-exchange chromatography; gel permeation chromatography; ammonium 
sulphate precipitation; acid precipitation and; 

immobilised antigen affinity chromatography (for example, for McPce03 
Vmin see reference : Glockshuber, R. et al. 1990, Biochemistry 29, 
1362-1367, and references contained therein) . 

Renaturation and purification of insoluble V-min is accomplished 
by cell lysis (sonication or high pressure homogenisation) , 
centrifugation, washing of the pellet fraction with buffer or mild 
chaotropes, solubilisation in chaotropic agents (eg. 6 M guanidine 
HCl, 8 M urea, detergents etc with or without reductant being present) 
followed by the controlled removal of the chaotropic agent (eg. by 
dilution or dialysis) and formation of the disulphxde bonds by 
oxidation reactions (eg, using dissolved air, glutathione redox 
buffers, etc) . The resulting solubilised, renatured V-min is purified 



* 

For example. E, goli paste, containing McPC603 Vmin inclusion bodies, 
was resuspended in lysis buffer (IS * sucrose, 50 mM 

(ethylenediaminetetraacetic acid, hereafter abbreviated to EDTA, 50 mM 
Tris (hydroxymethyl) aminomethane hydrochloride, pHS.O) to an optical 
density of 50 (550 nm wavelength) . Lysozyme was added to a final 
concentration of i mg/ml and the suspension was left on ice for 30 
minutes. The sample, on ice, was sonicated in 4 x 45 second bursts and 
the resulting lysed cell suspension centrifuged at 2S000 g for 30 
minutes. The pellet, containing the Vtain inclusion bodies, was 
resuspended in phosphate buffered saline and recentrifuged at 25000 g 
for 30 minutes. 
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The washed inclusion bodies were dissolved in a urea solution (8 M 
urea/50 nM potassium phosphate/50 mM beta mercaptoethanol . pH 7.4) and 
the resulting solution clarified by centrif ugation . The clarified 
solution was dialysed against 50 mM Tris (hydroxymethyl) aminomethane 
hydrochloride, pH 10.5 overnight at 4oC. The resulting solution was 
clarified by centrif ugation and the supemate, containing the 
solubilised Vtain, further dialysed against phosphate buffered saline 
and then purified utilising methods outline above. 



Aeti.V3.tv aaaaya 

General details of activity assays are given in 3.2 below, m the 
assay for McPCS03 v-min with c-myc tag, binding material suitable for 
the MCPC603 antibody is used in place of the Colo205 cells used for 
19.9 V-min. An exan5»le of a binding material suitable for McPC603 
V-min assay is a phosphorylcholine-protein conjugate (17) . m this 
case steps 1, 2 and 3 of the 19.9 v-min assay are replaced by:- 
1. lOOMl of a phosphorylcholine-protein conjugate solution 

(typically lOMg/ml but depending on the particular conjugate) in O.OSM 
carbonate/bicarbonate buffer pH9.6 is added to each well and incubated 
at 4<>c for 16 hours in a Oamp atmosphere. 



assay procedure, provided appropriate tyrosine residues 
are maintained within the Vmin structure, utilises the change in 
fluorescence observed on the binding of the phosphocholine hapten to 
the Vmin as described for McPC603 antibody fragments in reference : 
Glockshuber et al, 1990, Biochemistry 29, 1362-1367 and references 
contained therein. 

An alternative rapid qualitative assay for McPC603 binding to 
phosphocholine is provided by the selective binding of Vrain protein to 
immobilised phosphocholine matrices as described in reference : 
Glockshuber et al, 1990, Biochemistry 29, 13S2-13S7. 
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All or part of the Vmin gene sequence may be generated by the use of 
suitable primers and DWA sequence amplification using Polymerase Chain 
Reaction. Two primers are required for each part of the Vmin gene 
generated by PGR. Other regions of the gene can be generated by 
chemical synthesis, as detailed elsewhere in this document, by 
restriction enzyme isolation of suitable gene fragments, or by other 
standard molecular biology techniques, as deemed most appropriate for 
the particular Vmin gene sequence under consideration. The individual 
fragments of the Vmin gene may then be joined together by further 
rounds of PGR reactions, or by other standard molecular biology 
technic[ues. 

For example, in a typical Vmin construct consisting of two sequences 
derived from the Vh region of the heavy immunoglobulin chain of a 
specific antibody, joined to a sequence derived from the VI of the 
same antibody, all or any of the three Vmin regions may be derived by 
PGR, The specific sequence of the Vmin having been determined by 
molecular modelling, as detailed elsewhere in this document, and the 
Vh and VI nucleic acid sequence of the original antibody being 
available, in a molecular biology vector suitable for PGR - 
amplification. In the case, where the whole Vmin is to be generated by 
PGR, then primers are designed for the amplification reactions. For a 
three region Vmin construct six primers are required, two for each 
domain. In addition other primers may be required if it is neccessary 
to alter the native DMA sequence of the original antibody in the Vmin 
regions, in order to change specific codons as dictated by the peptide 
sequence of the modelled Vmin, or to change the codon usage to be 
compatible with the final Vmin expression system. Other functions or 
DNA sequence may be added to the Vmin sequence, for example 
restriction sites or a peptide leader or tag, either diiring the 
construction of the gene or during or by cloning into a succession of " 
molecular biology vectors for maintenance, sequencing or expression of 
the cloned nucleic acid sequence. 
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Primer design for the PGR of the Vmin regions is of particular 
importance. Primers are made to be complementary with the DNA 
sequence on either side of junction of the domain regions of the Vmin. 
Primers complementary to the ends of the Vmin may be extended if 
necessary to give features to aid the cloning of the Vmin sequence 
into the chosen molecular biology vector. Thus a Vh sequence having 
four DNA domains, A-BC-DE-FG-H, and a VI having three domains, 
I-JK-LM-N, which give a Vmin consisting of the three domains 
E-PC-DK-L, in that order, where A-B for example is a 5' to 3 ' DNA 
sequence from the 5' end of the Vh, and A and B are sequences of about 
20 nucleotides from each end of A-B. Thus the three pairs of primers 
to generate the three domains of the Vmin, with suitable overlaps, are 



^' ^ (with 5' addition for cloning if required) and the 

con^lement of FC, to generate E-FC by PCR from the Vh, 

2- FC and the con^lement of DK, to generate FC-DK by PCR from 

the Vh, 

3. . DK and the complement of L (with 5' addition for cloning if 

required) , to generate DK-L by PCR from VI. 

The three PCR fragments are then pxirif ied by standard Molecular 
Biology techniques, and may then be used to generate the Vmin, or 
intermediates by further rounds of PCR, 

For exan^le 

1. E-FC mixed with equimolar FC-DK, and primers E and 

complement of DK, in PCR will generate intermediate E-PC-DK. This 
when purified can be mixed with equimolar DK-L and primers E and 
complement of L, in PCR to generate the con?)lete Vmin, E-FC-DK-L. 
After purification this may be cloned into a suitable vector. 
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2. It is possible that the Vim may be generated by mixing E-FC, 

FC-DK and DK-L together in equimolar amounts, with primers E and 
complement of L in a PGR. 

3 • It is also possible that the Vmin may be generated by mixing 

all of the initial components together in a PGR. That is, Vh and VI 
approximately equimolar, with primers E, FC, DK and complements of 
DK and FC. 



It may be necessary to 
reactions and to vary 
find the best reaction 
set of primers. 



several combinations of the possible PGR 
conditions and component concentrations to 

for any specific Vmin generation and 



Any of the PGR generated Vmin fragments may be substituted by 
chemically synthesised double stranded DNA or restriction fragments or 
DNA fragments obtained by other molecular biology techniques, in order 
to generate the specific Vmin by PGR or other suitable molecular 
biology technique. 



3.1 Design Synthes is and Cloning of a Svnthetic V-min Gene for 

Antibody 19.9 

The amino-acid sequence for a V-min protein modelled for the antibody 
19.9 is shown in fig. 34, We have previously determined the complete 
DNA sequence of 19.9 heavy and light chain cDNAs (Tonge, D.W. et al, 
(1993), The Year in Immunology, vol 7, eds. Terhorst, C., Malavasi, F. 
and Albertini, A., published by Karger AG, Basel, Switzerland: 
Proceedings of Biotech RIA 92 - New Generation of Monoclonal 
Antibodies in Diagnosis and Therapy, Genoa, Italy, April 12-15, 
1992, published by Karger AG (Basel, Switzerland)) and used this 
informatiion to compile a DNA sequence capable of encoding the 
amino-acid seq[uence shown. 
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A synthetic V-min gene for antibody 19 . 9 was .designed according to the 
. principles described for the McPC603 V-min gene. The cloning regions 
and tag sequence which flank the coding sequence were identical to 
those used for the McPC603 synthetic V-min gene to maintain standard 
cloning strategies. The oligonucleotide sequences (ID No. 21-36) used 
to construct the gene are shown in fig. 33. 

Gene synthesis, cloning steps and expression studies were performed 
according to the protocols stated for the McPC603 V-min gene. 

3*2 Activity aaaayg . 

^•2.1 V-min detection anri aaaay 

v-min is detected by antibody reaction to a specific epitope, on the 
V-min 2gr as, or on a peptide tag added to the V-min during the 
cloning procedures . Detection is in the form of immunoassay, by 
Western blot, dot blot, Elisa or other antibody based detection 
system. An estimate of v-min concentration may then be made by 
comparison to standard material detected by the same system. 



For example, V-min is detected in bacterial supernatant by Western 
blot. Following separation of the protein oon^joaents by polyacrylamide 
gel electrophoresis, and electrophoretic transfer to a suitable 
membrane (for example nitro-cellulose) , the presence of V-min is 
detected by first blocking further non-specific binding to the 
membrane with a suitable protein solution (for exan5)le casein) , and 
then reaction with a V-min specific antibody (for exa.^ie a monoclonal 
antibody specific for a peptide tag on the V-min) . The presence and 
location of the V-min is identified by a label on the v-min specific 
antibody (for example an enzyme reaction with a suitable chromophore) 
or by a sandwich detection system employing a labelled antibody or 
other bi n d in g molectile reactive with the first V-min 
antibody. 
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A similar con5>etition system is used to estimate V-min concentration. 
V-min is iouad to microtitre plates, excess binding sites blocked with 
a suitable protein, and the binding of a V-min specific antibody 
(either epitope or tag reactive) to the bound V-min competed by the 
unknown concentration V-min solution (for example bacterial extract) , 
at suitable dilutions. Standard solutions of V-min are also used to' 
generate assay curves . .A reduction in antibody binding is detected by 
a label on the v-min specific antibody {for example an enzyme reaction 
with a suitable chromophore) or by a sandwich detection system using a 
labeUed antibody or other binding molecule reactive with the first 
V-min specific antibody. Examples of the methods may be found in the 
laboratory manuals of $ambrook et al (14) or Harlow and Lane (is) . 

^•^•^ Idantifieation and »^ of v-ml« binding <•» 

The binding of V-min to the antigen of the antibody from which the 
v-min CDR sequences were derived, or binding to a suitable derivative 
or substitute for that antigen (for example a peptide sequence 
identified from a protein antigen, or a glyco- lipid identified from a 
cellular antigen, or a mimotope) is determined by immunoassay (14.15) 
or direct physical methods (for- example using the Pharmacia Biacore 
apparatus or MePC603 Vmin fluorescence) . 

For example 19.9 v-rain with a c-myc C-terminal peptide tag sequence 
(EQKLISBEDL) is estimated in its binding to antigen in the form of the 
colo-rectal carcinoma cell line, C0I02OS (ATCC CCL 222) , by reaction 
with the monoclonal antibody 9E10 (16) . In a typical assay :- 
^' ^° C0I02OS cells (grown in serum free medium) in 100/il of 

PBS are added to each well of a microtitre plate and left at room 
tenperature for lo minutes. 

^' ""^^ plates are then centrifuged at about 400g for S m inutes 

at ICC, and left to equilibrate at room temperature for 30 minutes. 
3. lOO/il of 0.1% glutaraldehyde is added to each well to fix 

the cells, and the plates left at room temperature for 10 minutes. 
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^- "^^^ plates are washed 3 times with PBS, leaving the wells 

full for five minutes for each wash. The fiqal wash is emptied and 
the plates banged dry on paper towelling. 

5 . Excess binding sites in each well is blocked by the addition 

of 200M1 of a 1* BSA solution in PBS and incubation for one hour at 
room temperature. 

^' plates are washed 3 times with PBS, leaving the wells 

full for five minutes for each wash. The final wash is emptied and 
the plates banged dry on paper towelling. 

7. 100>1 of 19.9 v-min solution, of unknown concentration or 

standards, at a suitable range of dilutions in PBS is added to the 

wells and incubated in a dan^ atmosphere at 4-C for typically X6 
hours . 

8- The plates are washed 3 times with PBS, leaving the wells 
full for five minutes for each wash. The final wash is emptied and 
the plates banged dry on paper towelling. 

9- IOOmI of 9E10 hybridoma st^eraatant at a suitable dilution 
in PBS, typically 1/30, is added to each well and incubated for 2 
hours at 4"C. 

10. The plates are washed 3 times with PBS to which 0.05% Tween 
20 may be added, leaving the wells full for five minutes for each 
wash. The final wash is en^tied and the plates banged dry on paper 
towelling. 

11. lOO^il of an anti -mouse IgG labelled with horse-radish 
peroxidase, at a suitable dilution (for example Sigma Chemicals Co. 
product A9044 at a dilution of 1/2000) is added to each well and 
incubated at 4»C for 2 hours. This second antibody is tested for its 
non- reactivity with V-min. 

12. The plates are washed 3 times with PBS to which o.05% Tween 
20 may be added, leaving the wells full for five minutes for each 
wash. The final wash is emptied and the plates banged dry on paper 
towelling. 
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13 • To generate a suitable chromophore , the contents of one 

capsule of phosphate -citrate buffer with sodium perborate (Sigma 
p4 922) are dissolved in lOOiril of water according to the manufacturers 
instructions and 30mg of o-phenylenediamine dihydrochloride (OPD) 
added and dissolved. 100/il of this solution are added to each well, 
and incubated in the -dark for up to 30 minutes, until the colour 
develops . 

14. 50^1 of 25% sulphuric acid is added to each well to stop the 
reaction, and the absorbance of each well is read at 490nm. 

15. Stcuidard curves are then prepared, and the binding 
concentrations of unknowns is estimated. 

* 

Similar methods may be foimd in the laboratory manuals of Sambrook et 
al (14) or Harlow and Lane (15) . 

g^gAMPIiE 4 Generat ion of Dl . 3 V-min (conservative) using Prp 

A Vmin peptide sequence for the anti-lysozyme antibody Dl.3 (Amit, 
A.G., Mariuzza, R,A. , Phillips, S-E.V. & Poljak R.J. Science 233, 
747-754 (1986)), was generated by molecular modelling, based upon 
)cnown DNA and peptide sequence generated from the plasmid 
PSWX-VHD1.3-VKD1.3-TAG1 (Ward, S-E., Gussow, D., Griffiths A.D.,' 
Jones, P.T. & Winter G. Nature 341, 544-546 (1989)) by standard 
molecular biology DNA sequencing techniques. The peptide sequence of 
the Vmin is shown in sequence 45, and the Vmin consists of three 
domains from the D1.3 antibody, amino acids 65 to 115 from the D1.3 
heavy chain (1 to 51 in sequence 45) , amino acids 13 to 64 from the 
D1.3 heavy chain (52 to 103 in sequence 45) and amino acids 1 to 108 
from the 01.3 light chain (104 to 211 in sequence 45).. 
The sequence of the D1.3 insert in pSWl-VHDl .3-VKDl .3-TAGl as 
determined from the start of the first pelB through to approximately 
50 nucleotide bases after the TAGl sequence is shown in Figure 35. It 
can be seen that the sequence after the TAGl differs slightly from the 
published sequence (Ward, S.E., Gussow, D. , Griffiths A.D., Jones, 
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P.T. & Winter G. Nature 341, 544-546 (1989) ) .between the EcoRl 
restriction site and end of the TAGl. It is the sequence as shown in 
Figure 3 5 that was iised for the choice and sequence of the PGR 
primers. The specific choice of primers for Vmin gene construction 
will depend on the particular sequences to be included in the final 
construct . 

The sequences of the chosen primers are shovm in sequences 46 to 51. 
All primers are chosen to have about 20 bases complementary to the DNA 
strand on which they are priming at their 3' end, and if possible to 
end in a GC rich sequence. The single stranded DNA primers are used 
in pairs in three PGR reactions with pS Wl-VHDl. 3- VKDl .3 -TAGl double 
stranded DNA 

^" Primer Sequence 46 with primer sequence 49. 

Sequence 46 primer bases 22 to 39 are the same as bases 259 to 276 in 
Figure 35, except that the codon for Arg (262-264) is changed from AGA 
to CGT because of E.coli preferred codon usage (Andersson, S.G.E. & 
Kxirland C.G. Microbological Reviews 54, 198-210 (1990)). Bases i to 
21 of sequence 46 code for the seven amino acids Asp Tyr Pro Gly Ser 
Ser Gly and have a Smal restriction site (GGGCCC) at bases 7 to 12. 
The Smal site was included to allow blunt end cloning of the Vmin into 
an E.coli plasmid secretion vector. Additional codons were added and 
modelled at the start of the Vmin gene, because of the Arg at amino 
acid 2 in the Vmin sequence 45. It is known that charged amino acids 
at the start of a gene to be secreted into E.coli periplasm can block 
generation of the protein product (Geller B., Zhu H-Y. , Cheng s., Kuhn 
A. & Dalbey R.E. Journal of Biological Chemistry, 268, 9442-9447 
(1993)) , and the Gly Ser Ser Gly leader was designed to overcome this. 
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Sequence primer 49 bases 25 to 45 are complementary to bases 39i to 
411 in Figure 35, except that the codon for Val (400-402) is change to 
Leu to fit with the molecular model sequence of D1.3 Vmin (sequence 
45) . Bases 1 to 24 are complementary to bases 103 to 126 in Figure 



35. 



2- Primer sequence 47 with primer sequence 50. 

Sequence 47 primer bases 22 to 45 are the same as bases 103 to 126 in 
Figure 35. Primer bases 1 to 21 are the same as bases 391 to 4ii in 
Figure 35, except that the codon for Val (400-402) is change to Leu to 
fit with the molecular model secjuence of Dl.3 Vmin (sequence 45) . 

Sequence 50 primer bases 24 to 46 are con^lementary to bases 236 to 
258 in Figure 35. Primer bases 1 to 23 are complementary to bases 538 
to 560 in Figure 35 except that the codona for Glu and Leu (544-449) 
are changed to Gin and Met to fit with the molecular modelling of Dl.3 
Vmin (Sequence 45) . 

^' Primer sequence 46 with primer sequence 51. 

Sequence 48 primer bases 24 to 46 are the same as bases 538 to 560 in 
Figure 35 except that the codons for Glu and Leu (544-449) are changed 
to Gin and Met to fit with the molecular modelling of Dl.3 Vmin 
(Sequence 45) . Primer bases 1 to 23 are the same as bases 236 to 258 
in Figure 35. 

Sequence 51 primer bases 1 to 24 are complementary to bases 922 to 945 
in Figure 35. 
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The PCR reactions were performed using a thermocycler of 25 cycles of 
94 C for 1.5 minutes, S5°C for 1.5 minutes and 72°C for 2 minutes 
followed by a single period of 9.9 minutes at 72°C. Reactions were 20 
mxcrolitre total volume containing a concentration of i microMolar of 
the primers, o.lng of the pSWl-VHD1.3-VKD1.3-TAGl double stranded 
plasmxd DNA, dNTPs each at a concetration of 200 microMolar and 
amplification buffer and Taq DNA polymerase as recommended in ■ 
•Molecular Cloning - a laboratory manual, SambrooJc, j., Pritsch E F 
. Maniatis T. Published by Cold Spring Harbor I^oratory Press, second 
Edition 1989' or the material supplier's data sheets. PCR conditions 
may need to be optimised as recommended in the book. 

The structure of the Dl.3 Vmin. the D1.3 insert in 

PSW1-VHD1.3-VKD1.3.TAG1 double stranded DNA, and the primers for these 
PCR reactions are summarised diagramatically in Figure 36 The 
products Of the PCR reactions are summarised diagramatically in Figure 
37. All numbering in these diagrams is based upon the amino-acid 
sequence of the pi.3 antibody. 

The major product of each PCR reaction, with the correct molecular 
sxze, (approximately 200 base pairs for PCRs i and 2 and 400 for PCR3) 
was then band purified after running on a i* agarose gel using 
standard molecular biology techniques. The isolated DNA from each 
band was dissolved in water to a final concentration of 
Ing/microlitre . 

The next stage in the generation of the Vmin is the joining of the PCR 
products from the first round of PCRs. This could be attempted in a 
number of ways, as described above, but in this case it was found that 
a two stage PCR containing all three products of the first round of 
PCRs gave a satisfactory yield of the Vmin gene product. 
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The first part of the PGR consisted of 10 cyqles in a therroocycler of 

o o * 

94 c for 1 minute and 63 C for 4 minutes, with a hot start (reaction 

mixture heated to 94°C prior to the addition of the Taq polymerase) . 
The reaction mix contained each of the Vmin gene fragments isolated 
from the first round of PCRs at a concentration of 4ng per 80 
micolitres reaction volume together with dNTPs each at a concetration 
of 200 microMolar and amplification buffer and Tag DNA polymerase as 
recommended in 'Molecular Cloning - a laboratory manual, SambrooJc, j. , 
Fritsch, E.F. & Maniatis T. Published by Cold Spring Harbor Laboratory 
Press, second Edition 1989' or the material supplier's data sheets. 
PCR conditions may need to be optimised as recommended in the book. 
This first part of the. reactions should generate some of the complete 
Vmin sequence because of the con^alementary overlaps at the ends of the 
Vmin fragments from the first PCRs. 

The second part of the PGR consisted of 25 cycles of 94°C for 1.5 
minutes followed by 55°C for 2 minutes and 72°C for 2 minutes with a 
final hold of 72°C for 9.9 minutes. Prior to "these cycles the 
reaction volume was made up to 100 microlitres by the addition of the 
two end primers, seqences 46 and 51, to a concentration of ImicroMolar 
and fresh dKTPs each to a concetration of 200 microMolar and 
amplification buffer and fresh Taq DNA polymerase as recommended in 
'Molecular Cloning - a laboratory manual, Sambrook, J., Pritsch, E.F. 
& Maniatis T. Published by Cold Spring Harbor Laboratory Press, second 
Edition 1989' or the material supplier's data sheets. PCR conditions 
may need to be optimised as recommended in the book. 
The major product of the two part PCR, with the correct molecular 
size, (approximately 800 base pairs) was then band purified after 
running on a 1% agarose gel using standard molecular biology 
techniques. The isolated Vmin DNA from the band was dissolved in 
water to a final concentration of ing/microlitre . 



* 
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4.1 



Cloning of the PCR generated Vmin gang. 



The Vmin gene product from the PCRs was the restriction enzyme cut to 
render it compatible with the plasmid expression vector. The Vmin 
gene was cut with enzymes Smal and EcoRI, following standard molecular 
biology technitjues, the suppliers recommendations, and the methods 
detailed in 'Molecular Cloning - a laboratory manual, Sambrook, j. , 
Fritsch, E.F. & Maniatis T. Published by Cold Spring Harbor Laboratory 
Press, second Edition 1989' . The gene product was cleaned of small 
restriction fragments by standard techniques (for example Geneclean 
II, from Bio 101 Inc., PO Box 2284, La Jolla, California 92038, USA). 

The E.coli plasmid vector selected for periplasmic secretion 
expression of the Vmin gene was pICI266. This vector (pICI266) with 
a single pelB leader sequence has been deposited under the Budapest 
Treaty at the National Collections of Industrial and Marine Bacteria 
Limited (NCIMB), 23 St. Machar Drive, Aberdeen, AB2 IRY, Scotland, 
U.K. The date of deposit was 11th September 1993 and the accession 
number is NCIMB 40589. A plasmid map of plC126e is shown in Figure 
38. • 

« 

Plasmid DNA was generated by standard molecular biology techniques, 
using methods detailed in 'Molecular Cloning - a laboratory manual, 
Sambrook, j. , Fritsch, E.F. & Maniatis T. Published by Cold Spring 
Harbor Laboratory Press, second Edition 1989' . Plasmid DMA was enzyme 
digested with restriction enzyme Kpnl, using the manufacturer's 
recommended conditions . The cut plasmid DMA was then treated with 
Bacteriophage T4 DNA polymerase in the presence of high concentration 
of dllTPs (2milliMolar) at 12°C (details in 'Molecular Cloning - a 
latooratory manual, Sambrook, J. , Fritsch, E.F. & Maniatis T. Published 
by Cold Spring Harbor Laboratory Press, second Edition 1989') to 
remove the protruding 3' termini generated by l^nl, and making 
compatible with the blunt end of the Smal cut Vmin. The cut plasmid 
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product was cleaned by standard techniques Cfor example Geneclean XI, 
from Bio 101 Xnc.« PO Box 2284, La Jolla, California 92038, USA) and 
then enzyme digested with restriction enzyme EcoRI, using the 
manufacturer ' s recommended conditions . The cut plasmid product was 
cleaned of small restriction fragments by standard techniques (for 
exaxnple Geneclean II, from Bio 101 Inc., PO Box 2284, La Jolla, 
California 92038, USA) . 

The Vmin gene, (treated with Smal azid EcoRI) and the pXCI226 (treated 
with Kpnl, T4 polymerase and EcoRI) were then ligated together using 
T4 DNA ligase under standard molecular, biology conditions (details in 
'Molecular Cloning - a laboratory manual, Sarabroolc, J., Fritsch, E.F. 
& Maniatis T. Published by Cold Spring Harbor Laboratory Press, second 
Edition 1969' ) . 

The ligation mix was then transformed into a suitable Ara* E.coli 
strain such as that known locally as MSD213 (strai.n MClOOO of 
Casadaban, Journal of Molecular Biology 138, 179ff (1980) ) , (F~ ara 
delta (ara-leu) delta lacX74 galV galK strR) using calcium chloride 
treated cells as detailed in 'Molecular Cloning - a laboratory manual, 
Sambrook, J. , Fritsch, E.F. & Maniatis T. Piiblished by Cold Spring 
Harbor LeUDoratory Press, second Edition 1969' . Colonies were selected 
on L agar plates containing lOmicrograms/ml of tetracycline. 

4 .2 Identification of transf ormants expressing active D1.3Vmin, 



Individual colonies from the transformation were picked into numbered 

1.5ml Eppendorf tubes containing sterile 200microlitres of L broth 

plus 0.1% arabinose plus lOmicrograms/ml tetracycline. The tubes were 

well sealed and shaken upside down at approximately 200rpm at 30^C for 

e^30Ut 16 hours . Cells in the tubes were precipitated by centifugation 

for 5 minutes in a bench top microfuge at room temperature. The 

supernatant was used immediately in ELISAs, and the cell pellet kept 
o * 

at 4 C until positive clones had been identified and could be picked 
and streaked on to fresh L agar with tetracycline plates. 



RNIQ nana TX 



wo 94/12d25 PCT/GB93/02375 

- 72 - 



For the ELISA, polystyrene microtitre plates.. were prepared by the 
addition of 10 Omicrolitres of a amg/ml solution of Chicken Egg White 
lysozyme (SIGMA L6B76) in 0.05M carbonate/bicarbonate buffer pH9.6 
(SIGMA C3141) to each well. The plates were incubated at 4°c for 
about two hours. The wells were washed three times, with a S minute 
soak between, in excess phosphate buffered saline (PBS) (SIGMA P4417) . 
The plates were then banged dry onto tissues, and 15 Omicrolitres of a 
solution of a 1% Bovine serum albumin (SIGMA A7888 for exan^jle) 
solution in PBS added to each well. The plates were incubated at 4°c 
for about 2 hours. The wells were washed three times, with a 5 minute 
soak between, in excess phosphate buffered saline (PBS) (SIGMA P4417) 
The plates were then banged dry onto tissues, and 10 Omicrolitres of 
the bacerial supematants added to the wells, keeping careful note of 
numbers and positions so that individual clones can be identified. 
The plates were incubated at 4°C for about 16 hours. The wells were 
washed three times, with a s minute, soak between, in excess phosphate 
buffered saline (PBS) (SIGMA P4417) . The plates were then banged dry 
onto tissues., and 10 Omicrolitres of 9E10 hybridoma supernatant (Evan, 
G.I., tewis G.K., Ramsay G. & Bishop J.M. Mol. Cell Biol., S, 129-136 • 
(iSBS)) at a suitable dilution (typically about 1 in 30) added to each 
well. The plates were incubated at 4°C for about 2 hours. The wells 
were washed three times, with a 5 minute soak between, in excess 
phosphate buffered saline (PBS) (SIGMA P4417) containing 0.05% Tween 
20 (SIGMA P1379) . The plates were then banged dry onto tissues, and 
10 Omicrolitres of a suitable dilution of an anti -mouse IgG labelled 
with horse-radish peroxidase (for exainple SIGMA A0944 at a dilution of 
1 in 2000) added to each well. The plates were incubated at 4°c for 
about 2 hours. The wells were washed three times, with a 5 minute 
soak between, in excess phosphate buffered saline (PBS) (SIOIA P4417) 
containing 0.05% Tween 20 (SIGMA P1379) . The plates were then banged 
dry onto tissues, and lOOmicrolitres of a solution of 30mg 
o-phenylenediamine dihydrochloride (OPD) dissolved in 

phosphate-citrate buffer with sodium perborate (SIGMA P4922) added to 
each well. The plates are then incubated in the dark at room ' 
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temperature until the colour develops, SOmiqrolitres of a solution of 
25% sulphuric acid is added to each well to stop colour development. 
The absorbance of each well is read at 490nm. Positive wells are 
identified as having colour development more than twice background - 

The results of an assay are shown in Figure 48 as raw data from the 
plate reader. On this plate rows A and B were duplicate doubling 
dilutions of a positive D1.3-Fab-cmyc construct, which was also 
included as sample numbers 3 and 34 (positions C3 and F3 on plate) . 
Buffer blanks are included at positions CI, ElO, Fl and HIO, 
Bacterial controls are included at positions Ell, E12, Hll and H12, 
Picked colonies, numbered 1 to 64 (including positive controls 3 and 
34) are placed in the other wells starting at C2 and going from left 
to right and down the plate to H9. Thus the two positive identified 
clones from this plate, F6 (CD 0.540) and G4 (OD 0.398), correspond to 
colonies 37 and 47. The purified sequenced plasmid from 37 was later 
designated pJFH3037 (see below) . 

A number of positive wells were identified and the clones which led to 
them were grown and used to generate plasmid DNA by standard molecular 
biology methods (details in 'Molecular Cloning - a laboratory manual, 
Sambrook, J., Fritsch, E.F. & Maniatis T. Published by Cold Spring 
Harbor LaQaoratory Press, second Edition 1989'). The purified DNA was 
sequenced around the D1.3 Vtnin insert, and a number of them were found 
to have the Vmin sequence shown in Sequence 53. One of these plasmids 
was designated pJPH3037, and a plasmid map is shown in figure 39. 

4.3 Generat ion of E.coli cell pastes containing Dl.3 V^min 

E. coli strain MSD 213 was transformed with plasmid paFH3 037. The 
resultant strain MSD 215 (pJFH3037) was purified and maintained in 
glycerol stocks at -eOoC. Aliquots of each culttire were removed from 
stock and streaked onto agar plates of L- tetracycline to give separate 
single colonies after overnight growth at 37oC. A single colony of 
each culture was removed and suspended separately in 10ml 
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L- tetracycline broth and lOOul inrnediately i^culated into each of 
fifteen 250 ml Erlenmeyer flasks containing 75 ml of L- tetracycline 
broth supplemented with 0.1% w.v L-arabinose. After growth for ISh a 
3 0oC on a reciprocating shaker the bacteria were harvested in IL 
centrifuge bottles in a Sorvall RC-3B centrifuge (7000 x g, 4oC, 30 
min) . 



Purification and aeouenee identification of y-min m -» 
(converaatlw ) with r-mvc tag 

Pellets (frozen at -80°; from 4.3) of E.coli cell paste were thawed, 
resuspended in lysis buffer (SOmM Tris; 50mM EDTA, 15% Sucrose; imn' 
Benzamidiae. pHB.O) at 4° and 800m1 of 2.5% sodium dodecyl sulphate 
(SDS) solution was added for every 25 ml of the suspension. The 
mixture was left on ice for 30 min with occasional agitation, 
deoxyribonuclease added, mixed and the suspension was left on ice for 
a further 30 min with occasional agitation. The cell suspension was 
divided into 45 ml aliquots which were sonicated on ice for 12 x 30s 
bursts with 3 OS intervals between bursts. Sonicated suspensions were 
centrifuged at 13,000 rpm for 40 min at 4° and the supernatant 
collected. Phenyl methylsulphonyl Fluoride (PMSP) was added to the 
supernatant pool to give a imM concentration followed by dialysis 
overnight at 4° into phosphate buffered saline (PBS), pH 7.3. The 
dialysed supernatant was filtered through a 0.22Mm membrane, loaded on 
to an immobilised lysozyme column (CNBr-activated Sepharose 4B TM 
matrix based) equilibrated at 4° in PBS, pH 7.3. The column was 
washed with PBS and bound material was eluted by somM diethylamine , pH 
9.S. The successful binding of the D. 3 v-min to its immobilised 
antigen confirms maintenance of appropriate CDR conformations found in 
the parent DX.3 antibody. Eluted peak fractions were pooled, 
tri (hydroxymethyl) aminomethane (TRIS) added to give a lOOmM 
concentration and pH adjusted to around pH 7.5 and sterile filtered. 
Further purification, utilising known methods such as ion exchange or 
permeation chromatography may be performed if required. 
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A sample of the material was analysed by SDSrPAGE using a 12% 
TRIS -Glycine gel which was electro-blotted on to a nitrocellulose 
membrane followed by Western blot analysis using an anti- c-myc tag 
antibody (9E10) . The relevant band on a Coomassie stained 
nitrocellulose blot was sequenced by an automated protein/peptide 
sequencer using the Edman degradation technique. A positive match 
with the expected N-terminal sequence of D-Y-P-G-S-S-G-S-R-L was 
obtained (D-y-P-g-?-?-G-?-R-L-) thus confirming identity between the 
isolated protein and gene sequences. 

^XAMPl^K 5 Intra -cellular E .coli expression of Dl-3 Vmin f gnnservative 

In a similar fashion to the generation of the secreted Dl,3 Vmin gene 
detailed in Example 4, a Vmin gene was made by PGR for cloning into an 
inta-cellular E.coli expression vector. In this case the PGR primer 
shovm in sequence 52 was used in place of the primer sequence 46, in 
the first round PGR reaction with primer sequence 49, anri in the 
joining of the three first round PGR products together with primer 
sequence 51 in the second round PGR. The Vmin gene product then has 
the 5' sequence of primer sequence 52 at the start of the Vmin gene, 
and this provides an Hpal restriction site (GTTAAC) and sequence 
comprising RWWWWWWWW (where R is A or G, and W is A or T) preceding an 
ATG (methionine) start codon and the Vmin gene. The RWWWWWWWW 
generates a number of sequences that can act as ribosome binding 
sites, and effective constructs, which express the gene from a 
promoter in a suitable vector, can be selected by looking at 
expression levels. 

This Dl-3 Vmin gene for intra -cellular expression was restriction 

enzyme cut with Hpal and BamHI . An expression vector such as pICIOOSO 

(described in section 2.5.6.1) is rised for cloning of the Vmin. In 

this case the pIGIOOSO plasmid DMA is restriction cut with enzyme 

Kpnl , and then treated with T4 DNA polymerase to remove the 3 ' 

♦ 

protruding termini, in a similar manner to that described for plasmid 
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P1C1266 (section 4.1) . After cleaning the Kpnl cut/T4 DNA polymerase 
treated pIClOOSO is restriction digest cut with enzyme Bglll and the 
plasmid DNA cleaned as described. Restriction enzymes Bgin and BamHl 
generate identical cohesive ends, and the D1.3Vmin gene (BamHI and 
Hpal cut) is then ligated with pICIOOBO (Kpnl/T4 DNA polymerase and 
Bglll treated) . as described. This inserts the Vmin gene with a 
variety of potential ribosome binding sites under the control of the 
pTrp promoter of the pICIOOBO vector. 

The ligation mix is used to transform a range of E.coli host strains, 
some of which are protease deficient, to assess accumulation in 
different genetic backgrounds . The methods for expression studies are 
essentially the same as those presented in section 2.7, except that 
the 42°C induction is not required in the pTrp promoter system of 
PIClOOSO. Protein accumulation may be affected by the temperature of 
the growth of the culture, and so a range of temperatures are tested 
for each isolate. Isolates found to give positive bands in the 
Coomassie-blue stained protein gels and/or by Western, detection of the 
c-myc TAG sequence using antibody 9B10, are then used to generate 
plasmid DNA, This is sequenced around the vmin gene to check and 
confirm the correct nucleic acid sequence of the gene insert, and to 
identify the specif c ribosome binding site sequence, in one example 
AAAATATTT (from the RWWWWWWW possibilities) . Vmin produced by the 
intra-cellular expression system may then be extracted, isolated and 
renatured as described in section 2.8, and the activity tested in 
systems as described in section 4.2 for an ELISA binding to lysozyme. 

SSMffLKjS Vmin Con structs for fifdl) 
Ant i- Lysozyme antibody 

Fischmami T.O., Bentley G.A., Bhat T.N. , Boulot G. , Mariuzza R.a. , 

Phillips S.E.V., Tello D., Poljak R.j. 

Joximal, of Biological Chemistry 1991 266 12915. 
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^•^ a) h65-ii5/hi3-64/ll-i0B Intact, Light Chain 

Utilising methods set out hereinbefore the V-min construct in Figure 
41 was produced. 



^•2 b) 161-l67/ll3-60/hl-116 Intact Heavy Chain D1.3 

Utilising methods set out hereinbefore the V-rain construct in Figure 
42 was produced. 

^•^ c) h87-ll5/hl3-64/ll-7/121-61/l76«l08 Minimal Construct 

D1.3 



Utilising methods set out hereinbefore the V-min construct in Figure 
43 was produced. 

g^CMiPLE 7 Vmin Constructs for KOL (2fb4) 
Antibody from Human Myeloma patient KOL 

Kratzin H.D., Palm W. , Stangel M. , Schmidt W.E., Friedrich J., 
Hilschman N. 

Biological Chemistry Hoope-Seyler 1989 370 263. 

Marguart M., Deisenhofer J., Huber R. , Palm W. 
Journal Of Molecular Biology 1980 141 369. 

7-1 a) h67-i25/hl3-66/ll.ll2 Intact Light Chain KOL 



Utilising methods set out hereinbefore the V-min construct in Figtire 
44 was produced. 
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"^•2 b) 162-109/112-61/hl-125 Intact Heavy Chain 



KOL 



Utilising methods set out hereinbefore the V-min construct in Figure 
45 was produced. 



^>XMSPLS a Vmin Constructs for oac-l 



Antibody directed against Integrin Ilbllla 

Tomiyama Y. , Brojer E. , Rugger 2.M., Shattil S.J., Smiltneck J., 
Gorski J. , 

Kumar A., Kieber-Enmions T., Kunicki T.J. 

Journal of Biological Chemistry 1992 267 18085-18092. 

^> h66.ii8/hl3-€5/li-ll2 Intact Light Chain PAC-l 

Utilising methods set out hereinbefore the V-min consti^ct in Figure 

46 was produced. 

^> l66-lll/li2-65/hl-l23 Intact Heavy Chain PAC-i 

Utilising methods set out hereinbefore the V-min construct in Figure 

47 was produced. 
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SEOtJENCB LISTING 
(2) INFORMATION FOR SEQ ID NO: 1 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 25 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
ATC GAT GAA TTC CTG CAG CAT ATG A 25 
(2) INFORMATION FOR SEQ ID NO: 2 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 62 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 
( D } TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GTA CTG CTA TCT ACT ACT GTG CAC GTA ACT ACT ATG GCT CCA CTT 
GOT ACT TCG ACG TTT GG 
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(2) INFORMATION FOR SEQ ID NO: 3 



(i) SEQUENCE CHARACTERISTICS: 



(A) SEQUENCE LENGTH: 59 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: LdLlxear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



GGT GCT GGT ACC ACT GTA GGC GGC TCC CTC CGT CTG TCT TGC GCA 45 
ACT AGT GGC TTC 
AC 59 



(2) INFORMATION FOR SEQ ID NO : 4 
(i) SEQUENCE CHARACTER! S TXjCS : 

(A) SEQUENCE LENGTH: 60 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CTT CTC TGA CTT CTA CAT GGA GTG GGT ACG CCA GCC GCC GGG TAA 45 
ACG TCT CGA GTG GAT 60 
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(2) INFORMATION FOR SEQ ID NO: 5 



(i) 



SEQUENCE CHARACTERISTICS: 



(A) SEQUENCE LENGTH: 61 bases 



(B) SEQUENCE TYPE: Nucleotide 




(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CGC AGC TAG CCG TAA CAA AGG TAA CAA GTA TAC CAC CGA ATA CAG 



(2) INFORN!ATION FOR SEQ ID NO: 6 

(i) SEQUENCE CHARACTERXSTXCS : 

(A) SEQUENCE LENGTH: 61 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GTC TQA CCC AGT CTA TGT CTT GCA AAT CTT CTC AGT CTC TGC TGA 
ACT CTG GTA ACC AGA A 61 



CGC TTC 



TGT TGA TAT C 



61 



BNSDOCfO: <Wa 



.9412e25A2_l_> 



wo 94/12625 



83 



PCT/GB93/02375 



(2) INFORMATION FOR SEQ ID NO : 7 



(i) SEQUENCE CHARACTERISTICS 



(A) SEQUENCE LENGTH: 61 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: LdLnear 



(xi) SEQX7ENCE DESCRIPTION: SEQ ID NO: 7: 



AAA CTT CCT CGC GTG GTA TCA GCA AAA GCC TGG CCA GCC GCC GAA 45 
ACT CCT GAT CTA CGG T 61 



(2) INFORMATION FOR SEQ ID NO: 8 



(i) SEQUENCE CHARACTERISTICS: 



(A) SEQUENCE LENGTH: 51 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



GCG TCT ACG CGT GAA TCT GGT GTT CCG GAC CGT TTC ACC GAA GAC 45 
CTG GCT 51 
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(2) INFORMATION FOR SEQ ID NO : 9 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 60 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEONESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GTT TAC TAC TGT CAA AAT GAT CAC TCT TAG CCG CTG ACC TTT GGC 
GCC GGC ACC AAA CTG 60 

(2) INFORMATION FOR SEQ ID NO: 10 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 52 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GAA CAG AAA CTG ATC TCT GAA GAA GAC CTG AAC TAA TAA TAC GTA 
AGA TCT G 52 
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(2) INFORMATION FOR SEQ ID NO: 11 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 51 bases 

(B) SEQUENCE TYPE; Nucleotide 

(C) STRANDEDNESS : Single 

( D) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GAT CCA GAT CTT ACQ TAT TAT TAG TTC AGG TCT TCT TCA GAG ATC 45 
AGT TTC 51 

(2) INFORMATION FOR SEQ ID NO: 12 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 59 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

■ 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



TGT TCC AGT TTG GTG CCG GCG CCA AAG GTC AGC GGG TAA GAG TGA 
TCA TTT TGA CAG TA 



45 
59 
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(2) INFORMATION FOR SEQ ID NO: 13 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 62 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GTA AAC AGO CAG GTC TTC GGT GAA ACG GTC CGG AAC ACC AGA TTC 
ACG CGT AGA CGC ACC GT 62 



(2) INFORMATION FOR SEQ ID NO: 14 

(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 60 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

AGA TCA GGA GTT TCG GCG GCT GGC CAG GCT TTT GCT GAT ACC ACG 
CGA GGA AGT TTT TCT 60 • 
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(2) INFORMATION FOR SEQ ID NO: 15 



(i) SEQUENCE CHARACTERISTICS: 



(A) SEQUENCE LENGTH: 53 bases 




(c) stran: 



NESS : Single 



(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GGT TAG CAG AGT TCA GCA GAG ACT GAG AAG ATT TGC AAG ACA TAG 



(2) INFORMATION FOR SEQ ID NO: 16 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 60 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

ATG ACG ATA TCA ACA GAA GCG CTG TAT TCG GTG GTA TAC TTG TTA 
CCT TTG TTA CGG CTA 60 



ACT GGG TC 



53 
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(2) INFORMATION FOR SEQ ID NO : 17 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 61 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

( D ) TOPOLOGY : Linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GCT GCG ATC CAC TCG AGA CGT TTA CCC GGC GGC TGG CGT ACC CAC 45 
TCC ATGTAG AAG TCA G 61 

(2) INFORMATION FOR SEQ ID NO : 18 
(i) SEQUENCE CHARACTERISTICS: 

■ 

(A) SEQUENCE LENGTH: 58 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

AGA AGG TGA AGC CAC TAG TTG CGC AAG ACA GAC GGA GGG AGC CGC 45 
CTA CAG TGG TAC C 

58 



9412625A2_I _> 



wo 94/12625 



- 89 



PCT/GB93/02375 



(2) INFORMATIOK FOR SEQ ID NO : 19 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 63 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

AGC ACC CCA AAC GTC GAA GTA CCA AGT GGA GCC ATA GTA GTT 'ACG 
TGC ACA GTA GTA GAT AGC 63 

(2) INFORMATION FOR SEQ ID NO: 20 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 34 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



AGT ACT CAT ATG CTG CAG GAA TTC ATC GAT GTA C 



34 
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(2) INFORMATION FOR SEQ ID -NO: 21 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE liENGTH: 60 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GTA CTG GTA CTT ACT ACT GCA CTA CTC GCT TTG CAT ATT GGG GCC 45 
AAG GTA CTC TGG TTG 60 



(2) INFORMATION FOR SEQ ID NO: 22 

(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 59 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



GTG GTT CCA TGA AAC TCT CTT GCG CTG CCT CAG GCT TCA CTT TTA 45 
GCG ACG CCT GGA TG 

59 
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(2) INFORMATION FOR SEQ ID NO: 23 
(i) SEQUENCE CHARACTERISTICS: 



(A) SEQX7ENCE LENGTH: 58 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GAC TGG GTC CGT CAG TCT CCG GAG AAA GGT CTG GAA TGG GTG GCC 45 
GAA ATT GGC AAC A 5q 



(2) INFORMATION FOR SEQ ID NO: 24 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 46 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

AAG GTA ACA ACC ACG CAA CCT ACT ATG CTG AGT CTG TTG ATA TCA 45 
A 46 



A 



% 

t 
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(2) INFORMATION FOR SEQ ID NO : 25 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 47 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AAT GAC CCA GTC TAC TAT CAC GTG TAA GGC TAG CCA GGA CAT CAA 

* a 

CA 47 . 

(2) INFORMATION FOR SEQ ID NO: 26 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 55 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GCT ACC TGA GCT GGT TCC AGC AGA AAC CCG GGA AAT CTC CGA AAA 
CAT TAA TCT A 
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(2) INFORMATION FOR SEQ ID NO: 27 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 53 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



CCG TGC AAA CCG TCT GGT TGA TGG TGT ACC CTC GAG GTT CTC TGA 45 
AGA TAT GGG TAT TT *^ 59 



(2) INFORiMATION FOR SEQ ID NO: 28 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 56 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

ACT ACT GTC TAC AGT ATG ACG AGT TCC CGC GTA CCT TCG GTG GCG 45 
GCA CCA AGC TT 56 

4 
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(2) INFORMA.TION FOR SEQ ID NO : 29 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 55 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

TGT TCA AGC TTG GTG CCG CCA CCG AAG GTA CGC GGG AAC TCG TCA 
TAC TGT AGA C 



(2) INFORMATION FOR SEQ ID NO: 30 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 60 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

AGT AGT AAA TAC CCA TAT CTT CAG AGA ACC TCG AGG GTA CAC CAT 
CAA CCA QAC GGT TTG 60 
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(2) INFORMATION FOR SEQ ID NO: 31 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQXJENCE LENGTH: 56 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CAC GGT AGA TTA ATG TTT TCG GAG ATT TCC CGG GTT TCT GCT GGA 45 
ACC AGC TCA GG 56 

(2) INFORMATION FOR SEQ ID NO: 32 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQX7ENCE LENGTH: 45 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
TGA CTG TTG ATG TCC TGG CTA GCC TTA CAC GTG ATA GTA GAC TGG 45 
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(2) INFORMATION FOR SEQ ID NO : 33 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQX7ENCE LENGTH: 48 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GTC ATT TTG ATA TCA ACA GAC TCA GCA TAG TAG GTT GCG TGG TTG 
TTA Aft 



(2) INFORMATION FOR SEQ ID NO: 34 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 56 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY; Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

CCT TTG TTG CCA ATT TCG GCC ACC CAT TCC AGA CCT TTC TCC GGA 
GAC TGA CGG AC 



45 
56 
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(2) INFORMATION FOR SEQ ID NO: 35 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQOEINCE LENGTH: 6 0 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

CCA GTC CAT CCA GGC GTC GCT AAA AGT GAA GCC TGA GGC AGC GCA 4! 
AGA GAG TTT CAT GGA 60 

(2) INFORMATION FOR SEQ ID NO: 36 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 60 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

ACC ACC AAC CAG AGT ACC TTG GCC CCA ATA TGC AAA GCG AGT AGT 45 
GCA GTA GTA AGT ACC 60 
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(2) INFORMATION FOR SEQ ID NO : 3 7 
(i) SEQUENCE CHARACTERISTICS; 

(A) SEQUENCE LENGTH: 152 bases 

(B) SEQUENCE TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
AATTCTGGCA AATATTCTGA AATGAGCTGT TGACAATTAA TCATCGAACT AGTTAACTAG 60 
TACGCAAGTT CACGTAAAAA GGGTATCGAC AATGGTACCC GGGGATCCTC TAGAGTCGAC 120 
CTGCAGGCAT GCAAGCTTAG CCCGCCTAAT GAGCGGGCTT TTTTTTATCG AC 152 



(2) INFORMATION FOR SEQ ID NO: 38 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 66 bases 

(B) SEQX3ENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Double (top strand listed) 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
TCGACATTAT ATTACTAATT AATTGGGGAC CCTAGAGGTC CCCTTTTTTA TTTTAAAAAG 60 
CATGCA SS 
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(2) INFORMATION FOR SEQ ID NO; 3 9 



(i) SEQUENCE CHARACTERISTICS: 



(A) SEQUENCE LENGTH: 210 amino acids 

(B) SEQX7HNCE TYPE: amino acid 
(D) TOPLOGY: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Asp lie Val Met Thr Gin Ser Pro Ser Ser Leu Ser Val Ser Ala Gly 

1 5 10 ' 15 

Glu Arg Val Thr Met Ser Cys Lys Ser Ser Gin Ser Leu Leu Asn Ser 

20 25 30 

Gly Asn Gin Lys Asn Phe Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin 

35 40 45 

Pro Pro Lys Leu Leu lie Tyr Gly Ala Ser Thr Arg Glu Ser Gly Val 

50 55 60 

Pro Asp Arg Phe Thr Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr 
€5 70 75 80 

lie Ser Ser Val Gin Ala Glu Asp Leu Ala Val Tyr Tyr Cys Gin Asn 

85 90 95 

Asp His Ser Tyr Pro Leu Thr Phe Gly Ala Gly Thr Lys Leu Glu lie 

100 105 110 

Lys Arg Ala Asp Ala Ala Pro Thr Val Ser lie Phe Pro Pro Ser Ser 

105 110 115 

Glu Gin Leu Thr Ser Gly Gly Ala Ser val Val Cys Phe Leu Asn Asn 

120 125 130 

Phe Tyr Pro Lys Asp lie Asn Val Lys Trp Lys lie Asp Gly Ser Glu 

135 140 145 150 

Arg Gin Asn Gly Val Leu Asn Ser Trp Thr Asp Gin Asp Ser Lys Asp 

155 160 165 



NSOOCID: <WO 



9412625A2 1 > 



BNS oaae 101 



wo 94/12625 

Ser Thr Tyr Ser Met 

170 

Glu Arg His Asn Ser 
185 

Ser Pro lie Val Lys 
200 
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Ser Ser Thr Leu Thr Leu 

175 

Tyr Thr Cys Glu Ala Thr 
190 

Ser Phe Asn Arg Asn Glu 
205 



PCT/GB93/02375 

Thr Lys Asp Glu Tyr 
180 

His Lys Thr Ser Thr 
195 

Cys 
210 



(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 22 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 



Asp lie Lys Met 
1 

Glu Arg Val Thr 

20 

Leu Ser Trp Phe 
35 

Az-g Ala Asn Arg 
50 

Gly Ser Gly Gin 
65 

Asp Met Gly lie 

Phe Gly Gly Gly 

100 

Thr Val Ser lie 
115 

Ala Ser Val Val 
130 



Thr Gin Ser Pro 
5 

lie Thr Cys Lys 

Gin Gin Pro Gly 

40 

Leu Val Asp Gly 
55 

Asp Tyr Ser Leu 
70 

Tyr Tyr Cys Leu 
85 

Thr Lys Leu Glu 

Phe Pro Pro Ser 

120 

Cys Phe Leu Asn 
135 



Ser Ser Met Tyr 
10 

Ala Ser Gin Asp 
25 

Lys Ser Pro Lys 

Val Pro Ser Arg 

60 

Thr lie Ser Ser 
75 

Gin Tyr Asp Glu 
90 

Asn Lys Arg Ala 
105 

Ser Glu Gin Leu 

Asn Phe Tyr Pro 

140 



Ala Ser Leu Gly 
15 

lie Asn Ser Tyr 
30 

Thr Leu lie Tyr 
45 

Phe Ser Gly Ser 

Leu Glu Tyr Glu 

80 

Phe Pro Arg Thr 
95 

Asp Ala Ala Pro 
110 

Thr Ser Gly Gly 
125 

Lys Asp He Asn 
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Ala Lys Trp Lys lie 
145 

Ser Trp Thr Asp Gin 

165 

Thr Leu Ser Leu Thr 

180 

Tyr Thr Cys Glu Ala 
195 

Ser Phe Asn Arg Asn 
210 



Asp Gly Ser ,Glu Arg Gin 
150 155 
Asp Ser Lys Asp Ser Thr 

170 

Leu Thr Lys Asp Glu Tyr 

185 

Thr His Lys Thr Ser Thr 
200 

Glu Cys 
220 



Asn Gly Val Leu Asn 

160 

Tyr Ser Met Ser Ser 

175 

Glu Arg His Asn Ser 
190 

Ser Pro lie Val Lys 
205 



(2) lOTORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 



Glu Val Lys Leu 
1 

Ser Leu Arg Leu 

20 

Tyr Met Glu Trp 
35 

Ala Ala Ser Arg 
50 

Ser Val Lys Gly 
65 

Leu Tyr Leu Gin 
* 

Tyr Cys Ala Arg 

100 



Val Glu Ser Gly 
5 

Ser Cys Ala Thr 

Val Arg Gin Pro 

40 

Asn Lys Gly Asn 
55 

Arg Phe lie Val 
70 

Met Asn Ala Leu 
85 

Asn Tyr Tyr Gly 



Gly Gly Leu Val 
10 

Ser Gly Phe Thr 
25 

Pro Gly Lys Arg 

Lys Tyr Thr Thr 

60 

Ser Arg Asp Thr 
75 

Arg Ala Glu Asp 
90 

Ser Thr Trp .Tyr 
105 



Gin Pro Gly Gly 
15 

Phe Ser Asp Phe 
30 

Leu Glu Trp lie 
45 

Glu Tyr Ser Ala 

Ser Gin Ser lie 

80 

Thr Ala lie Tyr 
95 

Phe Asp Val Trp 
105 
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Gly Ala Gly Thr Thr Val Thr Val Ser Ser Glu Ser Ala Arg Asn Pro 

110 120 125 

Thr lie Tyr Pro Leu Thr Leu Pro Pro Ala Leu Ser Ser Asp Pro Val 
"0 135 3^40 

He He Gly Cys Leu He His Asp Tyr Phe Pro Ser Gly Thr Met Asn 

ISO 155 leo 

Val Thr Trp Gly Lys Ser Gly Lys Asp He Thr Thr Val Asn Phe Pro 

1^5 170 175 

Pro Ala Leu Ala Ser Gly Gly Arg Tyr Thr Met Ser Asn Gin Leu Thr 

180 185 190 

Leu Pro Ala Val Glu Cys Pro Glu Gly Glu Ser Val Lys Cys Ser Val 

195 200 205 

Gin His Asp Ser Asn Pro Val Gin Glu Leu Asp Val Asn Cys 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: 



(xi) SEQUENCE DESCRIPTION SEQ ID NO: 42 

Gin Val Lys Leu Glu Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 

^ 5 10 15 

Ser Met Lys Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Asp Ala 

20 25 30 

Trp Met Asp Trp Val Arg Gin Ser Pro Glu Lys Gly Leu Glu Trp Val 

35 40 45 

Ala Glu He Gly Asn Lys Gly Asn Asn His Ala Thr Tyr Tyr Ala Glu 

50 55 60 

Ser Val Lys Gly Arg Phe Thr Val Ser Arg Asp Thr Ser Lys Ser Arg 
" • 70 75 . 80 
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Val Tyr Leu Gin Met Asn Ser Leu Arg Val Glu Asp Thr Gly Thr Tyr 

85 90 95 

Tyr Cys Thr Thr Arg Phe Ala Tyr Trp Gly Gin Gly Thr Leu Val Thr 

100 105 110 

TVal Ser Ala Ala Lys Thr Thr Pro Pro Ser Val Tyr Pro Leu Ala Pro 

115 120 125 

Gly Ser Ala Ala Gin Thr Asn Ser Met Val Thr Leu Gly Cys Leu Val 

130 135 140 

Lys Gly Tyr Phe Pro Glu Pro Val Thr Val Thr Trp Asn Ser Gly Ser 
145 ISO 155 160 

Leu Ser Ser Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Asp Leu 

165 170 175 

Tyr Thr Leu Ser Ser Ser Val Thr Val Pro Ser Ser Thr Trp Pro Ser 

180 185 190 

Glu Thr Val Thr Cys Asn Val Ala His Pro Ala Ser Ser Thr Lys Val 

195 200 205 

Asp Lys Lys lie Val Pro Arg Asp Cys 
210 215 



(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS: 

t 

(A) LENGTH: 169 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 

Thr Ala lie Tyr Tyr Cys Ala Arg Asn Tyr Tyr Gly Ser Thr Trp Tyr 

15 10 15 

Phe Asp Val Trp Gly Ala Gly Thr Thr Val Gly Gly Ser Leu Arg Leu 

20 25 30 

Ser Cys Ala Thr Ser Gly Phe Thr Phe Ser Asp Phe Tyr Met Glu Trp 
35 An Aq 
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Val Arg Gin Pro Pro Gly Lys Arg Leu Glu Trp lie Ala Ala Ser Arg 

SO 55 60 

Asn Lys Gly Asn Lys Tyr Thr Tlir Glu Tyr Ser Ala Ser Val Asp ,Ile 
^5 70 75 80 

Val Met Thr Gin Ser Met Ser Cys Lys Ser Ser Gin Ser Leu Leu Asn 

85 90 35 

Ser Gly Asn Gin Lys Asn Phe Leu Ala Trp Tyr Gin Gin Lys Pro Gly 

100 105 110 

Gin Pro Pro Lys Leu Leu lie Tyr Gly Ala Ser Thr Arg Glu Ser Gly 

115 120 125 

Val Pro Asp Arg Phe Thr Glu Asp Leu Ala Val Tyr Tyr Cys Gin Asn 

130 135 140 

Asp His Ser Tyr Pro Leu Thr Phe Gly Ala Gly Thr Lys Leu Glu Gin 
1*5 ISO 155 160 

Lys Leu lie Ser Glu Glu Asp Leu Asn 

165 

(2) INFORMATION FOR SEQ ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 



Thr Gly Thr Tyr 
1 

Thr Leu Val Gly 

20 

Thr Phe Ser Asp 
35 

Gly Leu Glu Trp 
50 



Tyr Cys Thr Thr 
5 

Gly Ser Met Lys 
Ala Trp Met Asp 

40 

Val Ala Glu lie 
55 



Arg Phe Ala Tyr 
10 

Leu Ser Cys Ala 
25 

Trp Val Arg Gin 

Gly Asn Lys Gly 

60 



Trp Gly Gin Gly 
15 

Ala Ser Gly Phe 
30 

Ser Pro Glu Lys 
45 

Asn Asn His Ala 
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Thr Tyr Tyr Ala Glu Ser Val Asp He Lys Met Thr Gin Ser He Thr 

70 75 80 

Cys Lys Ala Ser Gin Asp He Asn Ser Tyr Leu Ser Trp Phe Gin Gin 

85 90 95 

Lys Pro Gly Lys Ser Pro Lys Thr Leu He Tyr Arg Ala Asn Arg Leu 

100 105 110 

Val Asp Gly Val Pro Ser Arg Phe Ser Glu Asp Met Gly He Tyr Tyr 

115 120 125 

Cys Leu Gin Tyr Asp Glu Phe Pro Arg Thr Phe Gly Gly Gly Thr Lys 
130 135 

Leu Glu Gin Lys Leu He Ser Glu Glu Asp Leu Asn End 
145 150 

(2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQT3ENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: protein 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: 



(xi) SEQUENCE DESCRIPTION SEQ ID NO: 45 



1 


Ser 


Arg Leu Ser He Ser Lys Asp Asn Ser 


Lys Ser Gin Val Phe 


Leu 


16 


17 


Lys 


Met Asn Ser Leu His Thr Asp Asp Thr 


Ala Arg Tyr Tyr Cys 


Ala 


32 


33 


Arg 


Glu Arg Asp Tyr Arg Leu Asp Tyr Trp Gly Gin Gly Thr Thr 


Leu 


48 


49 


Thr 


Val Ser Ala Pro Ser Gin Ser Leu Ser 


He Thr Cys Thr Val 


Ser 


64 


65 


Gly 


Phe Ser Leu Thr Gly Tyr Gly Val Asn 


Trp Val Arg Gin Pro 


Pro 


80 


81 


Gly Lys Gly Leu Glu Trp Leu Gly Met He 


Trp Gly Asp Gly Asn 


Thr 


96 


97 


Asp 


Tyr Asn Ser Ala Leu Lys Asp He Gin 
• 


Met Thr Gin Ser Pro 


Ala 


112 


113 


Ser 


Leu Ser Ala Ser Val Gly Glu Thr Val 


Thr He Thr Cys Arg 


Ala 


128 


129 


Ser 


Gly Asn He His Asn Tyr Leu Ala Trp 


Tyr Gin Gin Lys Gin 


Gly 


144 
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192 



14 5 Lys Ser Pro Gin L,eu Leu Val Tyr Tyr Thr Thr Thr Leu Ala Asp Gly 160 
161 Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Gin Tyr Ser Leu 176 
177 Lys lie Asn Ser Leu Gin Pro Glu Asp Phe Gly Ser Tyr Tyr Cys Gin 
193 His Phe Trp Ser Thr Pro Arg Thr Phe Gly Gly Gly Thr Lys Leu Glu 208 
209 lie Lys Arg 211 
(2) INFORMATION FOR SEQ ID NO: 46: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 46: 
1 GACTATCCCGGGTCTTCTGGCTCCCGTCTGAGCATCAGC 39 

(2) INFORMATION FOR SEQ ID NO: 47: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 47: 
1 GGCACCACGCTGACCGTCTCCGCGCCCTCACAGAGCCTGTCCATC 45 

(2) INFORMATION FOR SEQ ID NO: 48: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: 



(xi) SEQUENCE DESCRIPTION SEQ ID NO: 48: 
CAGACTATAATTCAGCTCTCAAAGACATCCAGATGACCCAGTCTCC 4 6 
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(2) INFORMATION FOR SEQ ID NO: 49: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45. nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 49: 

1 GATGGACAGGCTCTGTGAGGGCGCGGAGACGGTCAGCGTGGTGCC 
(2) INFORMATION FOR SEQ ID NO: 50: 

» 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 50: 
1 GGAGACTGGGTCATCTGGATGTCTTTGAGAGCTGAATTATAGTCTG 

(2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 nucleotides 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : single. 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 51: 
1 CGACGGCCAGTGAATTCGAGCTGG 24 
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(2) INFORMATION FOR SEQ ID NO: 52: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 52: 

1 GCAACTAGTTAACTAGTACGCAAGTTCACGTAAAAAGGRWWWWWWWWATGTCCCGTCTGA 60 
6 1 GCATCAGCAAGG 7 2 



(2) INFORMATION FOR SEQ ID NO: 53: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION SEQ 

1 GAC TAT CCC GGG TCT TCT GGC TCC 

1 Asp Tyr Pro Gly Ser Ser Gly Ser 

4 9 TCC AAG AGC CAA GTT TTC TTA AAA 

17 Ser Lys Ser Gin Val Phe Leu Lys 

97 ACA GCC AGG TAC TAC TGT GCC AGA 

33 Thr Ala Arg Tyr Tyr Cys Ala Arg 

145 TGG GGC CAA GGC ACC ACG CTG ACC 

4 9 Trp Gly Gin Gly Thr Thr Leu Thr 

193 TCC ATC ACA TGC ACC GTC TCA GGG 

65 Ser lie Thr Cys Thr Val Ser Gly 



ID NO: 53: 

CGT CTG AGC ATC AGC AAG GAC AAC 48 

Arg Leu Ser He Ser Lys Asp Asn 16 

ATG AAC AGT CTG CAC ACT GAT GAC 96 

Met Asn Ser Leu His Thr Asp Asp 32 

GAG AGA GAT TAT AGG CTT GAC TAC 144 

Glu Arg Asp Tyr Arg Leu Asp Tyr 48 

GTC TCC GCG CCC TCA CAG AGC CTG 192 

Val Ser Ala Pro Ser Gin Ser Leu 64 

TTC TCA TTA ACC GGC TAT GGT GTA 240 

Phe Ser Leu Thr Gly Tyr Gly Val 80 
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241 AAC TGG GTT CGC GAG CCT CCA GGA AAG GGT CTG GAG TGG CTG GGA ATG 288 

81 Asn Trp Val Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp Leu Gly Met 96 

28 9 ATT TGG GGT GAT GGA AAC ACA GAC TAT AAT TCA GCT CTC AAA GAC ATC 336 

97 He Trp Gly Asp Gly Asn Thr Asp Tyr Asn Ser Ala Leu Lys Asp He 112 

337 CAG ATG AGO GAG TCT CCA GCC TCC CTT TCT GCG TCT GTG GGA GAA ACT 384 

113 Gin Met Thr Gin Ser Pro Ala Ser Leu Ser Ala Ser Val Gly Glu Thr 128 

385 GTC ACC ATC ACA TGT CGA GCA AGT GGG AAT ATT CAC AAT TAT TTA GCA 432 

129 Val Thr He Thr Cys Arg Ala Ser Gly Asn He His Asn Tyr Leu Ala 144 

433 TGG TAT CAG CAG AAA CAG GGA AAA TCT CCT CAG CTC CTG GTC TAT TAT 480 

145 Trp Tyr Gin Gin Lys Gin Gly Lys Ser Pro Gin Leu Leu Val Tyr Tyr 160 



225 Glu Glu Asp Leu Asn 



721 CGA ATT CAC TGG CCG TCG 73 8 



481 ACA ACA ACC TTA GCA GAT GGT GTG CCA TCA AGG TTC AGT GGC AGT GGA 

161 Thr Thr Thr Leu Ala Asp Gly Val Pro Ser Arg Phe Ser Gly Ser Gly 176 

529 TCA GGA ACA CAA TAT TCT CTC AAG ATC AAC AGC CTG CAA CCT GAA GAT 576 

177 Ser Gly Thr Gin Tyr Ser Leu Lys He Asn Ser Leu Gin Pro Glu Asp 192 

577 TTT GGG AGT TAT TAG TGT CAA CAT TTT TGG AGT ACT CCT CGG ACG TTC 624 

193 Phe Gly Ser Tyr Tyr Cys Gin His Phe Trp Ser Thr Pro Arg Thr Phe 208 

625 GGT GGA GGG ACC AAG CTC GAG ATC AAA CGG GAA CAA AAA CTC ATC TCA 672 

209 Gly Gly Gly Thr Lys Leu Glu He Lys Arg Glu Gin Lys Leu He Ser 224 

673 GAA GAG GAT CTG AAT TAA TAA TGA TCA AAC GGT AAT AAG GAT CCA GCT 720 



240 
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(2) INFORMATION FOR SEQ ID NO: 54: 
(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 244 amino acids 

(B) TYPE: protein 

(C) STRANDEDNESS : single 

(D) TOPOI-OGY: linear 

(xi) SEQXJENCE DESCRIPTION SEQ ID NO: 54 



1 Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu 



17 Ala Gin Pro Ala Met Ala Ser Arg Leu Ser lie Ser Lys Asp 
3 3 Lys Ser Gin Val Phe Leu Lys Met Asn Ser Leu His Tiir Asp 



Leu Ala is 



Asn Ser 32 



Asp Thr 48 



49 Ala Arg Tyr Tyr Cys Ala Arg Glu Arg Asp Tyr Arg Leu Asp Tyr Trp 
65 Gly Gin Gly Thr Thr Leu Thr Val Ser Ala Pro Ser Gin Ser Leu Ser 



•64 



80 



81 lie Thr Cys Thr Val Ser Gly Phe Ser Leu Thr Gly Tyr Gly Val Asn 96 



97 Trp Val Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp Leu Gly 



Met lie 112 



113 Trp Gly Asp Gly Asn Thr Asp Tyr Asn Ser Ala Leu Lys 



Asp lie Gin 128 



129 Met Thr Gin Ser Pro Ala Ser Leu Ser Ala Ser Val Gly Glu 



Thr Val 144 



145 Thr He Thr Cys Arg Ala Ser Gly Asn He His Asn Tyr Leu Ala Trp 160 



• 

161 Tyr Gin Gin Lys Gin Gly Lys Ser Pro Gin Leu Leu Val 



Tyr Tyr Thr 176 
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177 Thr.Thr Leu Ala Asp Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser 192 

193 Gly Thr Gin Tyr Ser Leu Lys He Asn Ser Leu Gin Pro Glu Asp Phe 2 08 

209 Gly Ser Tyr Tyr Cys Gin His Phe Trp Ser Thr Pro Arg Thr Phe Gly 224 

225 Gly Gly Thr Lys Leu Glu He Lys Arg Glu Gin Lys Leu He Ser Glu 240 

241 Glu Asp Leu Asn 244 

(2) INFORMATION FOR SEQ ID NO: 55 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 55: 

Ser Arg Leu Ser He Ser Lys Asp Asn Ser Lys Ser Gin Val Phe Leu 

15 10 15 

Met Asn Ser Leu His Thr Asp Asp Thr Ala Arg Tyr Tyr Cys Ala Arg 

20 25 30 

Glu Airg Asp Tyr Arg Leu Asp Tyr Trp Gly Gin Gly Tyr Thr Leu Thr 

35 40 45 

Val Ser Ala Pro Ser Gin Ser Leu Ser He Thr Cys Thr Val Ser Gly 

50 55 «0 

Phe Ser Leu Thr Gly Tyr Gly Val Asn Trp Val Arg Gin Pro Pro Gly 
65 70 75 80 

Lys Gly Leu Glu Trp Leu Gly Met He Trp Gly Asp Gly Asn Thr Asp 

85 90 95 

Tyr Asn Ser Ala Leu Lys Asp He Gin Met Thr Gin Ser Pro Ala Ser 
100 105 110 
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Leu Ser Ala Ser Val Gly Glu Thr 

115 120 
Gly Asn He His Asn Tyr Leu Ala 
130 135 

Ser Pro Gin Leu Leu Val Tyr Tyr 

150 

Pro Ser Arg Phe Ser Gly Ser Gly 

165 

He Asn Ser Leu Gin Pro Glu Asp 
180 185 
Phe Trp Ser Thr Pro Arg Thr Phe 

195 200 
Lys Arg 
2X0 

(2) INFORMATION FOR SEQ ID NO: 56 
(i) SEQUENCE CHARACTERISTICS: 
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Val Thr He Thr Cys Arg Ala Ser 

125 

Trp Tyr Gin Gin Lys Gin Gly Lys 
140 145 
Thr Thr Thr Leu Ala Asp Gly Val 

155 160 
Ser Gly Thr Gin Tyr Ser Leu Lys 
170 175 
Phe Gly Ser Tyr Tyr Cys Gin His 

190 

Gly Gly Gly Thr Lys Leu Glu He 

205 



(A) LENGTH: 211 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: 

Arg Phe Ser Gly Ser Gly Ser Gly 
1 5 
Ser Leu Gin Pro Glu Asp Phe Gly 

20 

Ser Thr Pro Arg Thr Phe Gly Gly 

35 40 
Ser Val Gly Glu Thr Val Thr lie 

50 55 
His Asri Tyr Leu Ala Trp Tyr Gin 
65 70 



SEQ ID NO 56: 

Thr Gin Tyr Ser Leu Lys lie Asn 

10 15 
Ser Tyr Thr Cys Gin His Phe Trp 
25 30 
Gly Thr Lys Leu Glu lie Lys Ala 

45 

Thr Cys Arg Ala Ser Gly Asn lie 

60 

Gin Lys Gin Gly Lys Ser Pro Gin 
75 80 



nw«rw*tr> ^wn 
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Leu Leu Val Tyr Tyr Thr Thr Thr Leu Ala Asp Gly Val Pro Ser Gin 

85 90 95 

Val Gin Leu Lys Glu Ser Gly Pro Gly Leu Val Ala Pro Ser Gin Ser 

100 105 110 

Leu Ser lie Thr Cys Thr Val Ser Gly Phe Ser Leu Thr Gly Tyr Gly 

lis 120 125 

Val Asn Trp Val Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp Leu Gly 

130 135 140 

Met lie Trp Gly Asp Gly Asn Thr Asp Tyr Asn Ser Ala Leu Lys Ser 
145 150 155 160 

Arg Leu Ser lie Ser Lys Asp Asn Ser Lys Ser Gin Val Phe Leu Lys 

165 170 175 

Met Asn Ser Leu His Thr Asp Asp Thr Ala Arg Tyr Tyr Cys Ala Arg 

180 185 190 

Glu Arg Asp Tyr Arg Leu Asp Tyr Trp Gly Gin Gly Thr Thr Leu Thr 

195 200 205 

Val Ser Ser 
210 



(2) INFORMATION FOR SEQ ID NO: 57 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 163 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 57: 

Thr Asp Asp Thr Als Arg Tyr Tyr Cys Ala Arg Glu Arg Asp Tyr Arg 
1 5 10 15 

Leu Asp Tyr Trp Gly Gin Gly Thr Thr Leu Thr Val Ser Ala Pro Ser 

20 25 30 
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Gin Ser Leu Ser lie Thr Cys Thr Val Ser Gly Phe Ser Leu Thr Gly 

35 40 45 

Tyr Gly Val Asn Trp Val Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp 

50 55 go 

Leu Gly Met He Trp Gly Asp Gly Asn Thr Asp Tyr Asn Ser Ala Leu 
^5 70 75 80 

Lys Asp He Gin Met Thr Gin Ser He Thr Cys Arg Ala Ser Gly Asn 

85 90 95 

He His Asn Tyr Leu Ala Trp Tyr Gin Gin Lys Gin Gly Lys Ser Pro 

100 105 110 

Gin Leu Leu Val Tyr Try Thr Thr Thr Leu Ala Asp Gly Val Pro Ser 

H5 120 125 

Arg Asn Ser Leu Gin Pro Glu Asp Phe Gly Ser Tyr Try Cys Gin His 

135 140 145 

Phe Trp Ser Thr Pro Arg Thr Phe Gly Gly Gly Thr Lys Leu Glu He 



ISO 



Lys Arg 



(2) INFORMATION FOR SEQ ID NO: 58 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 



(Xi) SEQXJENCE DESCRIPTION: SEQ ID NO 58: 



Arg Phe Thr He 
1 

Met Asp Ser Leu 

20 



Ser Arg Asn Asp 
5 

Arg Pro Glu Asp 



Ser Lys Asn Thr 
10 

Thr Gly Val Tyr 
25 



Leu Phe Leu Gin 
15 

Phe Cys Ala Arg 
30 
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Asp Gly Gly His Gly Phe Cys Ser Ser Ala Ser Cys ,Phe Gly Pro Asp 

35 40 45 

Tyr Trp Gly Gin Gly Thr Pro Val Thr Val Ser Gin Pro Gly Arg Ser 

50 55 60 

Leu Arg Leu Ser Cys Ser Ser Ser Gly Phe lie Phe Ser Ser Tyr Ala 

^5 70 '7C 

75 Qo 

Met Tyr Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Ala 

85 90 95 

He He Trp Asp Asp Gly Ser Asp Gin His Tyr Ala Asp Ser Val Lys 

100 105 110 

Gly Gin Ser Val Leu Thr Gin Pro Pro Ser Ala Ser Gly Thr Pro Gly 

120 125 
Gin Arg Val Thr He Ser Cys Ser Gly Thr Ser Ser Asn He Gly Ser 
"0 135 

Ser Thr Val Asn Trp Tyr Gin Gin Leu Pro Gly Met Ala Pro Lys Leu 
"5 150 155 160 

Leu He Tyr Arg Asp Ala Met Arg Pro Ser Gly Val Pro Asp Arg Phe 

1". 170 175 

Ser Gly Ser Lys Ser Gly Ala Ser Ala Ser Leu Ala He Gly Gly Leu 

180 185 190 

Gin. Ser Glu Asp Glu Thr Asp Tyr Tyr Cys Ala - Ala Trp Asp Val Ser 

193 200 205 

Leu Asn Ala Tyr Val Phe Gly Thr Gly Thr Lys Val Thr Val Leu Gly 
210 215 220 

Gin 
225 
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(2) INFORMATION FOR SEQ ID NO: 59 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 226 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 59: 



Arg Phe Ser Gly Ser Lys Ser Gly Ala Ser Ala Ser Leu Ala He Gly 

1 5 10 15 

Gly Leu Gin Ser Glu Asp Glu Tiir Asp Tyr Tyr Cys Ala Ala Trp Asp 

20 25 30 

Val Ser Leu Asn Ala Tyr Val Phe Gly Thr Gly Thr Lys Val Gly Thr 

35 40 , 45 

Pro Gly Gin Arg Val Thr He Ser Cys Ser Gly Thr Ser Ser Asn He 

50 55 ^0 

Gly Ser Ser Thr Val Asn Trp Tyr Gin Gin Leu Pro Gly Met Ala Pro 
^5 • 70 75 80 

Lys Leu Leu He Tyr Arg Asp Ala Met Arg Pro Ser Gly Val Pro Asp 

85 90 95 

Glu Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 

100 105 110 

Ser Leu Arg Leu Ser Cys Ser Ser Ser Gly Phe He Phe Ser Ser Tyr 

120 125 130 

Ala Met Tyr Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 

135 140 145 

Ala He He Trp Asp Asp Gly Ser Asp Gin His Tyr Ala Asp Ser Val 
150 155 160 165 

Lys Gly Arg Phe Thr He Ser Arg Asn Asp Ser Lys Asn Thr Leu Phe 

170 175 ISO 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 

185 190 195 
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Ala Arg Asp Gly 
200 

•Pro Asp Tyr Trp 
215 



Gly His Gly Phe 

205 

Gly Gin Gly Thx 
220 



Cys Ser Ser Ala 

Pro Val Thr Val 

225 



Ser Cys Phe Gly 

210 

Ser 



(2) INFORMATION FOR SEQ ID NO: 60 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 224 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 60: 



Arg Leu Ser lie Ser Lys Asp Asn Ser Lys Ser Gin Val Phe Phe Lys 

IS 10 15 

Met Asn Ser Leu Gin Ala Asn Asp Thr Ala lie Tyr Tyr Cys Ala Arg 

20 25 30 

Arg Ser Pro Ser Tyr Tyr Arg Tyr Asp Gly Ala Gly Pro Tyr Tyr Ala 

35 40 45 

Met Asp Tyr Trp Gly Gin Pro Ser Gin Ser Leu Ser lie Thx Cys Thr 

50 55 60 

Val Ser Gly Phe Ser Leu Thr Ser Tyr Gly Val His Trp Val Arg Gin 
65 70 80 85 

Ser Pro Gly Lys Gly Leu Glu Trp Leu Gly Val lie Trp Ser Gly Gly 

90 95 100 

Ser Thr Asp Tyr Asn Ala Ala Phe lie Ser Asp Val Leu Met Thr Gin 

105 110 115 

Gin Thr Pro Leu Ser Leu Pro Val Ser Leu Gly Asp Gin Ala Ser lie 

120 125 130 

Ser Cys Arg Ser Ser Gin Ser lie Val His Ser Asn Gly Asn Thr Tyr 

135 , 140 145 

Leu Glu Trp Tyr Leu Gin Lys Pro Gly Gin Ser Pro Lys Leu Leu lie 
ISO 155 160 165 
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Tyr Lys Val Ser Asn 

170 

Ser Gly Ser Gly Thr 

185 

Glu Asp Leu Gly Val 
200 

Thr Phe Gly Gly Gly 
215 

(61) INFORMATION FOR 



- 118 - 

Arg Phe Ser Gly Val Pro 

175 

Asp Phe Thr Leu Lys lie 

190 

Tyr Tyr Cys Phe Gin Gly 
205 

Thr Lys Leu Glu lie Lys 
220 



SEQ ID NO: 61 
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Asp Arg Phe Ser Glv 

180 

Ser Arg Val Glu Ala 
195 

Ser His Val Pro Tyr 
210 



(i) SEQUENCE CHARACTERISTICS: 



(A) . LENGTH: 229 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 

(xi) SEQI3ENCE DESCRIPTION: SEQ ID NO 61: 



Ar-g Phe Ser Gly Ser 
1 5 
Arg Val Glu Ala Glu 

20 

His Val Pro Tyr Thr 
35 

Ser Leu Gly Asp Gin 
50 

Val His Ser Asn Gly 
65 

Gly Gin Ser Pro Lys 

85 

Gly Val Pro Asp Gin 

100 

Gin Pro Ser Gin Ser 
115 



Gly Ser Gly Thr Asp Phe 

10 

Asp Leu Gly Val Tyr Tyr 

25 

Phe Gly Gly Gly Thr Lys 
40 

Ala Ser lie Ser Cys Arg 
55 

Asn Thr Tyr Leu Glu Trp 
70 75 
Leu Leu lie Tyr Lys Val 

90 

Val Gin Leu Lys Gin Ser 

105 

Leu Ser lie Thr Cys Thr 
120 



Thr Leu Lys lie Ser 

15 

Cys Phe Gin Gly Ser 
30 

Leu Glu He Pro Val 
45 

Ser Ser Gin Ser He 
60 

Tyr Leu Gin Lys Pro 

80 

Ser Asn Arg Phe Ser 

. 95 

Gly Pro Gly Leu Val 
110 

Val Ser Gly Phe Ser 
125 
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Leu Thr Ser Tyr Gly Val His Trp Val Arg.Gln Ser Pro Gly Lys Gly 

135 .140 
Leu Glu Trp Leu Gly Val lie Trp Ser Gly Gly Ser Thr Asp Tyr Asn 
^« ISO 155 

Ala Ala Phe He Ser Arg Leu Ser He Ser Lys Asp Asn Ser Lys Ser 

1S5 

Gin Val Phe Phe Lys Met Asn Ser Leu Gin Ala Asn Asp Thr Ala He 

"0 185 190 

Tyr Tyr Cys Ala Arg Arg Ser Pro Ser Tyr Tyr Arg Tyr Asp Gly Ala 

200 20S 
Gly Pro Tyr Tyr Ala Met Asp Tyr Trp Gly Gin Gly Thr Ser Val 

220 225 
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Table la 

Cof-Cof distances for linker sections of MPC Fv fragment 
L55-84 and L104-107 versus Ll-21 Canonical numbering 

Distances up to 17A reported 



Light Chain 
2 3 4 5 6 



Light 

55 

56 

57 

58 

59 

60 

61 

62 

63 

64 

65 

66 
67 
68 
6$ 
70 
71 
72 
73 
74 
75 
76 
77 
78 
79 

eo 

81 
82 
83 
84 



8 9 10 



11 12 13 14 15 16 17 18 19 20 21 



17 17 17 
16 17 17 
16 16 14 15 16 17 
16 16 15 15 17 
17 17 14 14 14 15 17 
15 15 13 14 IS 17 

16 13 13 12 13 15 17 
15 11 11 9 

17 13 12 10 
15 13 10 
17 16 12 

14 



17 17 16 17 

15 14 15 14 15 16 

16 14 15 13 14 14 
17 17 15 15 12 11 11 9 10 12 

16 17 15 17 14 13 12 10 10 

17 17 16 16 14 11 11 
17 



16 13 11 
IS 13 



10 
9 
9 

10 



16 12 
16 



17 
17 
16 
14 
11 
11 
10 
11 
12 
15 
17 



10 


12 


14 


17 
























16 


14 


9 


10 


11 


14 


16 




















16 


12 


11 


10 


10 


11 


13 


15 


17 


17 














17 


14 


10 


9 


11 


9 


10 


11 


13 


15 


14 


17 


17 








16 


13 


10 


6 


6 


13 


11 


11 


12 


13 


14 


13 


15 


15 




16 


16 


14 


11 


8 


S 


6 


16 


14 


13 


13 


15 


14 


12 


14 


13 


15 


13 


13 


10 


7 


5 


4 


7 




16 


15 


14 


15 


15 


12 


13 


11 


13 


10 


10 


8 


6 


6 


7 


9 








17 




17 


14 


14 


12 


12 


10 


8 


7 


5 


7 


9 


12 












17 


14 


13 


10 


10 


6 


4 


5 


5 


8 


11 


14 








16 


17 


14 


11 


10 


8 


7 


4 


5 


5 


6 


8 


11 


13 












16 


13 


12 


10 


9 


5 


6 


8 


10 


11 


14 


17 












16 


13 


12 


10 


9 


7 


9 


11 


13 


14 


17 


17 












16 


14 


14 


13 


13 


10 


12 


14 


15 


15 


17 


17 






17 


16 


15 


13 


11 


12 


11 


11 


9 


11 


12 


12 


11 


13 


13 




17 


16 


15 


13 


11 


10 


10 


10 


11 


10 


13 


13 


14 


11 


14 


14 




14 


15 


14 


12 


10 


10 


12 


13 


15 


13 


16 


16 


15 


13 


14 


13 


17 


13 


11 


10 


9 


6 


5 


6 


7 


10 


10 


13 


12 


12 


9 


11 


10 




16 


14 


12 


11 


8 


6 


5 


6 


8 


9 


13 


12 


13 


12 


13 


13 






17 


15 


14 


11 


8 


6 


5 


5 


6 


10 


10 


12 


12 


14 


15 








16 


16 


12 


10 


6 


5 


5 


7 


11 


11 


13 


13 


16 


17 



104 

105 
106 
107 



(M17fi7RA? I > 
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Table lb 

Ca-Ca distances for linker sections of MPC Pv fragment 
H60-88 versus LI -21 Canonical numbering 

Distances up to 17A reported 
liight Chain 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 IB 19 20 21 

Heavy 

60 10 13 14 16 

61 10 14 15 17 

62 12 15 16 

63 15 

64 15 
65 

66 
67 
68 
69 

* 

70 

71 

72 

73 

74 

75 

76 

77 

78 
79 
80 
81 
82 
83 
84 
85 
86 
87 
88 
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Table le 

Cdf-Ccif distances for linker sections of MPC Fv fragment 

H60-88 and H108-113 versus Hl-21 Canonical numbering 

Distances up to 17A reported 
Heavy Chain 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 

Heavy 

60 17 
61 

52 16 16 17 16 17 17 

53 17 17 17 15 14 15 13 14 14 14 17 

54 17 16 17 15 16 16 16 
^5 16 14 15 13 15 15 17 

55 17 17 15 15 13 11 11 10 11 12 14 16 
67 16 16 14 15 17 17 15 16 14 13 12 10 10 10 11 13 

€8 16 16 15 16 17 17 15 14 11 11 9 lo 12 

S9 16 16 14 15 14 16 17 14 12 10 10 10 

"70 • 16 16 14 15 15 15 14 10 11 10 

71 17 16 14 14 13 14 15 15 12 n 9 

72 17 14 15 14 15 16 17 13 13 10 

73 17 16 15 16 16 17 16 ig 13 

74 17 17 16 17 17 

75 17 16 14 13 13 14 15 17 16 15 n 

76 15 12 11 10 11 12 14 16 I5 14 iq 

77 16 14 12 9 9 9 11 13 16 15 12 10 7 

78 17 15 13 10 10 9 11 12 15 17 14 10 9 6 

79 16 12 12 9 10 10 12 16 17 17 13 10 7 6 5 

80 14 13 10 10 10 11 15 16 15 17 17 16 14 10 8 6 5 6 

81 16 13 12 10 12 14 15 13 15 15 13 11 7 6 4 6 9 

82 14 13 11 11 13 13 11 12 12 10 8 5 5 6 8 11 

83 17 14 14 15 14 11 11 10 7 6 5 7 9 11 14 

84 16 15 15 13 10 9 7 4 6 6 9 12 14 

85 17 14 12 12 11 8 7 5 5 6 7 9 12 13 17 
8€ 17 15 14 12 9 8 5 7 9 10 12 15 16 

87 17 14 13 11 10 8 7 10 11 12 14 17 17 
8 8 15 14 13 12 12 10 12 13 14 15 17 17 

88 17 16 14 12 12 11 10 11 10 11 12 11 11 14 13 16 
88 16 15 13 10 9 9 10 11 11 13 13 12 12 14 13 16 
88 17 14 14 13 10 10 11 12 14 14 16 15 14 13 15 12 15 



108 16 13 10 9 8 5 6 9 8 14 15 17 15 14 11 12 10 12 

109 15 12 10 8 5 5 6 8 10 12 13 12 10 9 11 9 12 

110 15 14 11 6 6 5 7 9 10 12 11 11 11 14 13 le 

111 16 13 10 8 6 5 6 6 9 9 10 10 14 14 17 

112 16 12 10 6 6 5 6 10 10 12 13 17 17 

113 • 15 12 9 8 5 6 10 10 13 IS 



<wi?«>aA? I > 
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Table Id : Ca-Cof distances for linker sections of MPC Fv fragment 
LS5-84 versus HI- 21 Canonical numbering 

Distances up to 17A reported 
Heavy Chain 

1 2 3 4 5 e 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 

Light 

55 17 15 17 

56 16 15 17 

57 16 16 17 
58 

59 

60 

61 

62 

63 

64 

65 

66 

67 

68 

69 

70 

71 

72 

73 

74 

75 

76 

77 

78 

79 

80 

81 

82 

83 • 
84 
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Table le 



Ca-Cof distances for linker sections of MPC Fv fragment 

Ii55-84 versus L104-107 H60-88 versus H108-113 Canonical 

numbering 

Distances up to 17A reported 



Light 
55 
56 
57 
58 
59 
60 
61 
62 
63 
64 



66 
67 
68 
69 
70 
71 



Light Chain 
104 105 106 107 



17 
17 

IS 
13 
16 



17 

16 



16 

16 



72 


15 








73 


12 


16 


17 




74 


12 


15 


15 




75 


10 


13 


13 


16 


76 


13 


15 


14 


16 


77 


12 


13 


12 


14 


78 


9 


10 


8 


11 


79 


. 10 


10 


8 


11 


60 


10 


9 


7 


10 


81 


10 


10 


9 


13 


82 


7 


8 


9 


12 


83 


5 


6 


7 


11 


84 


6 


8 


10 


14 



Heavy 
60 
61 
62 
63 
64 
65 
66 
67 
68 
69 
70 
71 
72 
73 
74 
75 
76 
77 
78 
79 
80 
81 
82 
83 
84 
85 
86 
87 
86 
86 
88 
88 



Heavy Chain 
108 109 110 111 112 113 



17 



16 
15 



17 
16 



15 
15 



16 



15 
14 
16 
17 



17 

16 



14 
14 



17 



17 


17 










15 


15 










13 


12 


15 


16 






15 


13 


15 


14 






13 


10 


•12 


11 


15 


17 


16 


13 


14 


12 


14 


16 


17 


13 


13 


10 


12 


13 


14 


10 


10 


7 


9 


11 


13 


10 


9 


6 


8 


10 


14 


11 


7 


5 


6 


9 


13 


10 


9 


8 


10 


13 


10 


8 


7 


7 


10 


13 


7 


5 


5 


5 


8 


12 


6 


6 


7 


9 


12 


16 
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Table 


1£: Ca-Ca distances 


for 


iinJcer 


sections of MPC Fv fragment 








Intra 


Section relationships 








Canonical numbering 










Distances up to 17A reported 




















Light Chain 










Heavy Chain 






Liaht 


11 


12 13 


14 .15 16 17 18 19 20 


21 


1 Mea.w 


11 


12 


13 14 15 16 17 18 19 


20 


*5 1 


1 










1 1 












2 






• 




1 2 






• 




xo 


3 








16 


1 3 








16 




4 






16 


13 


1 4 






16 


13 




5 


15 




17 15 


11 


1 . 5 






16 14 


10 


8 


6 


13 


16 


15 11 


7 


1 ^ 


16 


17 


16 13 10 


7 


5 


7 


9 


13 15 


16 12 10 


6 


1 7 


13 


14 


17 14 10 9 


6 


5 


8 


7 


10 12 


16 17 16 15 11 9 


6 


1 8 


10 


11 


15 17 17 13 11 7 6 


5 


6 


9 


7 


10 13 


17 16 13 12 


6 


1 9 


8 


12 


14 15 12 10 7 8 7 


7 


9 


10 


4 


7 10 


14 15 15 15 12 12 


9 


1 ^0 


4 


7 


10 13 14 12 11 9 11 


10 


13 



Light Chain 



jight 


71 


72 


73 


74 


75 


76 


77 


78 


79 


80 


81 


82 


83 

• 


84 


55 


17 


16 


13 


15 


14 


16 










17 


16 




17 


56 






16 


17 


16 


17 












17 






57 






17 


17 


15 


16 


17 


17 


16 




15 


14 


17 


17 


58 




17 


13 


13 


11 


13 


13 


14 


13 


15 


12 


11 


14 


14 


59 




17 


13 


12 


9 


10 


10 


11 


11 


14 


11 


10 


14 


15 


60 




15 


12 


10 


9 


8 


10 


12 


13 


16 


14 


13 


16 


17 


61 


17 


15 


10 


8 


5 


5 


6 


8 


10 


13 


12 


10 


14 


15 


62 


13 


11 


7 


6 


5 


6 


9 


10 


12 


14 


13 


10 


13 


13 


63 


11 


8 


6 


5 


7 


8 


11 


13 


15 




16 


14 


16 


16 


64 


7 


5 


4 


6 


9 


11 


14 


15 








15 




17 


65 


6 


6 


7 


9 


12 


13 


17 
















66 


4 


6 


9 


11 


15 


17 


















67 


6 


9 


12 


15 






















68 


8 


12 


15 
























69 


7 


10 


14 


17 






















70 


4 


7 


11 


13 


17 





















t 
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Table If continued 



Heavy Heavy Chain 







77 


"7 n 
/ o 


7 Q 


o u 


O X 


o ^ 


o ^ 


a ^ 


O 9 


o o 


fl 7 


o o 


R R 


R R 


fl R 
D O 


b U 






17 


16 


13 


14 


13 


IS 


16 


15 


16 


17 


15 


13 


16 


15 


61 










16 


16 


14 


15 


15 


14 


15 


16 


13 


13 


17 


17 


62 










14 


14 


12 


13 


12 


11 


11 


13 


10 


9 


13 


13 


63 






17 


15 


11 


11 


9 


9 


10 


10 


11 


13 


11 


9 


13 


13 


64 






17 


15 


12 


12 


11 


11 


12 


13 


14 


17 


14 


13 


17 


17 


65 








16 


13 


11 


10 


9 


10 


12 


13 


17 


15 


14 






66 






17 


15 


10 


8 


6 


5 


6 


8 


11 


14 


13 


11 


15 


16 


67 




16 


13 


10 


7 


6 


5 


6 


8 


9 


12 


15 


13 


11 


14 


14 


68 


17 


14 


11 


8 


6 


5 


7 


8 


11 


12 


16 




17 


14 


17 


17 


69 


14 


11 


7 


6 


5 


6 


9 


11 


15 


15 








16 




17 


70 


11 


9 


6 


5 


7 


8 


12 


14 


















71 


7 


6 


4 


6 


9 


11 


14 


17 


















72 


6 


5 


6 


6 


11 


13 


17 




















73 


5 


7 


8 


IX 


14 


16 














• 








74 


6 


8 


11 


13 


17 
























75 


4 


6 


9 


11 


15 


17 
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DUPLEX OLIGONUCXEOTIDES NUMBER OF BASES IN 

TOP STRAND BOTTOM STRAND 



A 


SEQ 


ID 


No 


1 




SEQ 


ID 


No 


20 


25 


34 


B 


SEQ 


ID 


No 


2 


+ 


SEQ 


ID 


No 


19 


62 


63 


C 


SEQ 


ID 


No 


3 


+ 


SEQ 


ID 


No 


18 


59 


58 


D 


SEQ 


ID 


No 


4 


+ 


SEQ 


ID 


No 


17 


60 


61 


E 


SEQ 


ID 


No 


5 


+ 


SEQ 


ID 


No 


16 


61 


60 


F 


SEQ 


ID 


No 


6 


+ 


SEQ 


ID 


No 


15 


62 


53 


G 


SEQ 


ID 


No 


7 




SEQ 


ID 


No 


14 


61 


60 


K 


SEQ 


ID 


No 


8 


+ 


SEQ 


ID 


No 


13 


51 


62 


I 


SEQ 


ID 


No 


9. 


+ 


SEQ 


ID 


No 


12 


60 


59 


J 


SEQ 


ID 


Nolo 




SEQ 


ID 


No 


11 


52 


51 



I 

II 
III 



A + B '+ C + D 
E F + G 



H + I + J 



206 
184 
163 



216 
173 
172 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule 13bis) 



A. The indications made below relate to the mictoor^nism 
onpge jine 



to in tbe description 



pIGI 1079 



B. IDENTIFTCATION OF DEPOSIT 
Name of deposiuiy institution 



Funiier deposits m ideatiCed on an additional sheet 
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/n(ftan<£s for any other designated state'. * 



O. DESIGNATED 



FOR WmCH INDICATIONS ARE I^E (J/^i-diai.ia»x««««A,r.ff 




For receiving OHjce use only 



This sheet was received with the tntcmaUonal application 



Authorized oCGcer 



Form PCT/RO/134 (July 1992) 



For international Bureau use only 



[X] This sheet was received by the International Bureau on; 
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Authorized officer 



BNSDCXSID: <WO 



9412625A2 I 
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INDICATIONS RELATTNG TO A DEPOSITED MICROORGANISM 

(PCr Rule I3bis) 



The indicatiaiB nude below rebte to the ratcroor^niscn referred to in the 
on page ^ O ^ , line I 



on 



plCI Zbfa 



H. IDENTIFICATION OF DEPOSIT 



Ftmber deposits are identified on an additional sheet Q 



Name of depositary institution 



NATIONAL COLLECTION OF INDUSTRIAL AND MARINE BACTERIA 



Address of depositary institution fmeiuding postal epdcaadeauairy) 
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AcoBssion Number 
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C ADDITIONAL INDICATIONS (lax^ blank if act appiicable) Ibis information is continued on an additional sheet Q 



'in resx>ea, of alJ designated states in which such action is possibie and to 

the extern that ft is legally^ pennnssibieundertha law of the designated state. 
It is raqtf ested that a sample of the deposited niiao-o{ganism(s) be made 
available only by the issue thereof to an independent expea in acx»aJance 
with the relevant patent legislation, e.g. EPC Rule 28(4). UJC Rule 17(3). 
Australian Regulation 3.25(3) and generally similar provisions mutatis 
mutandis for any other designated state*. 



D, DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are fifraUiiaignaiedSiaia) 



E- SEPARATE FURNISHING OF INDICATIONS (IcayeUaniL if not appiicable) 



The indications listed below will besubmitted to the iniemational BuTe»uiiier(specifytiiegeneralnatureofii»einj{ietttionscg^ 'Accession 
Number of D^osMi*) 



For receiving Office use only 



This sheet was received^vith the tnternaiional appiicaiion 



Authorized officer 



Form PCT/RO/134 (July 1992) 



For intemationai Bureau use only 



Th s sheet was received by the iniemalionai Bureau on; 

0 2 OSCEMBRE 1993 



Authorized officer 
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CIAIMS 



1- A ligand binding variable domain (V-min) comprising: 

A) a framework region comprising: 

(a) a cyclically permuted central fi-barrel; 

(b) outer beta-sheet segments and; 

(c) linker sections; and 

5) complementarity determining regions (CDRs) ; 

whereby said central fi-barrel, optionally in combination with said 
outer fi-sheet segments, orients said CDRs in space for binding ligand; 
the linker sections provide linking between A) & B) to form the V-min, 
in which the framework region per se has upto about 180 amino acids. 

2. A V-min according to claim 1 wherein the central beta barrel 
has intact strands . 

3. A ligand binding variable domain (V-min) con^rising: 

(a) a cyclically permuted central fi-barrel havdLng intact strands; 

(b) outer beta-sheet segments; 

(c) complementarity determining regions (CDRs) ; and 

(d) linker sections for linking integers (a) , (b) & (c) into a V-min, 
whereby said central S-barrel, optionally in combination with said 
outer fi- sheet segments, orients said CDRs in space for binding ligsuid. 

4. A V-rain according to claim 3 wherein framework integers (a) , 
(b) and (d) per se have upto about 160 amino acids. 

5. A V-min according to any one of claims 1-4 wherein the CDRs 
substantially reproduce the spatial orientation of CDRs from a parent 
antibody and substantially retain the ligand binding specificity and 
affinity of the parent antibody. 

6. . . A V-min according to any one of claims 1-5 wherein the V-min 
is humanised. 
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7. A V-min according to any one of claims 1-6 wherein the 
V-min is in the form of one continuous polypeptide. 

8. A V-min according to any one of claims 1*7 comprising the 
following, canonically numbered, core segments: light chain residues 
21-55; light chain residues 84-104; heavy chain residues 21-60 and; 
heavy chain residues 88-lOd. 

9. A multivalent ligand binding moiety comprising at least one 
V-min as defined in any one of claims 1-8. 

10. An conjugate comprising a V-min as defined in any one of 
claims 1-8 and an effector molecule. 

11. A polynucleotide sequence encoding a V-min as defined in any 
one 0f claims 1-8. 

12 . An expression vector con^rising a polynucleotide sequence as 
defined in claim 11 . 

13 . A host cell comprising an expression vector as defined in 
claim 12 . 

14. A method of producing a V-rain comprising culture of a host 
cell as defined in claim 13 . 

15 . An immunotoxin pharmaceutical composition comprising a V-min 
as defined in any one of claims 1-8 in association with a 
pharmaceutical excipient or carrier. 

16 . A radioimaging composition comprising a V-min as defined in 
any one of claims 1-8 in association with a pharmaceutical excipient 
or carrier. 
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17. An antibody- enzyme pharmaceutical composition for ADEPT 
comprising a V-min as defined in any one of claims 1-8. 

18 . A method of treatment of a tumour conprising the 
administration to a human or animal in need of such treatment of a 
pharmaceutically effective amount of a pharmaceutical composition 
defined in any one of claims 15 or 17 . 

Id • Use of a radioimaging composition defined in claim 16 to 

produce a radioimage as an aid to diagnosis of a condition in a human 
or animal. 



9412625A2 I > 



wo 94/12625 



PCT/GB93/02375 



1/55 



Figure 1 Beta SfrT^ayif^ Arrangement of Tttmw^m^f Tiobuiir t vat-Ha ble Reaiong 



LIGHT CHAIN 



HEAVY CHAIN 



5 



19- 

I 

20 
I 

21- 

I 

22 
I 

C- 
I 

24 
I 

25 



76 61 



---75 



---73 



62 



74 63 



64 



66 



72 65 

I I 
71 

I 

70 
I 

69 



67 



68 



I 

18- 

I 

19 
I 

20- 



21 



5 23 

i I 



3 



24- 
I 

25 



82 



81 68 



80 



69 



79 70 



C 78 

I . I 
77 

I 

--76 
I 

75 



71 



72 



73 



74 



III 



L3 



97- 

I 

98 

I 

99- 



---90 



/ 



100 



I 



-c 

I 
I 

87 



101 



102- 



- — 86 



10 
1 

|. 
11 

12 

I 

13 



103 



104 84 



I 

105 

I 

106 

I 

107 



I 



Xi2 



33 50 



89 34 



W 

% 



49-- 

I 

48 



36 



37 45 



85 38 



52 
• 

-53 

•I 
54 

\ I 
* 47* I 

I '55 
46 I 

/ 

\ 

44 

I 

43 

/ 



39 42 



40 



I 

41 



I 



I 

102 94 

I 

103 

I 

104 — - 

/ 



105 



I 

10£ 



i 8 



\ 

107 



I 

C 

I 
I 

91 

I 
I 

90 



I I 
I I 
I 10 



11 

I I 
12' 



I 



108 

.'I 

' I 
109 

I 

'110 

' I 
111 

I 

'112 

I 



88 



95 33 



I 

34 — 

I 



93 



52 5S 

I I 
51 57 

I 1 
50 58 



I 

W — 

I ^ 

I 

37 



I 

--49 



89- 



38-- 

I 
I 

39 



--46 



\ 

48. 

I 

47 

/ 

\ 

45 

1 

44 

/ 



40 — 

I 

41 



--43 



42 



I 

59 

I 
I 

60 



Structurally important residues Cysteines (L23,Ii88; H22,H92) 
by C also Tryptophans Ij35 and H36 are represented by W. 
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The core segmentis are emboldened 
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Picmre 3 Beta Strand Arz-anaenent of Ttiwm^T^n frlobttlin Variable Regions 



The V-min Congtruct 4m 
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1 5- strands above paper 
: fi- strands behind paper 
. vertical linkers 
- horizontal linkers 
• # non- sequential links for v-min 

The emboldened sequence is the V-min construct. 
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FTGDRE 4 - ikLIGMMESIT OF FAB LIGHT CHMHS FOR P19 . 9 AND IMCP 



1 50 
imcpl DIVMTQSPSS LSVSAGERVT MSCKSSQSLL NSGNQKNFIA WYQQKFGQPP 

F1991 DIKMTQSPSS MYASLGBRVT ITCK ASODI NSYI iS WFQQKPGKSP 

<* 83 > < * LI«b • 84 44 

99 

Imcpl KLLIYGASTR ESGVPDRFTG SGSGTDFTLT ISSVQAEDIA VYYCQNDHS 
F1991 KTLI YRANR L VDGVPSRFSG SGSGQDYSLT ISSLEYEDMG lYYCL OYDE 

* Lll-a >• * < s5 • 93 

147 

imcpl YPLTFGAGT KLEIKRADA APTVSIFPPS SEQLTSGGAS WCFLNNFYP 
F1991 £EETFGGGT KLENKRADA APTVSIFPPS SEQLTSGGAS WCFLNNFYP 

LIII > 141 

195 

imcpl KDINVKWKID GSERQNGVLN SWTDQDSKDS TYSMSSTL. . TLTKDEYERH 
F1991 KDINAKWKID GSERQNGVLN SWTDQDSKDS TYSMSSTLSL TLTKDEYERH 

191 

220 

imcpl NSYTCEATHK TSTSPIVKSF NRNEC 
F1991 NSYTCEATHK TSTSPIVKSF NRNEC 

216 

The MCP and F199 sequences are listed in SEQ ID NOS: 39 and 40 
respectively. 
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PIGtTRE 4 CONTIUUJilD 



ALIGHMBMT OF FAB HEAVY CHAINS FOR F19 . 9 AND IMCP 



1 50 
Imcph EVKLVESGGG LVQPGGSLRL SCATSGFTFS DFmEWVRQP PGKRIiEWIAA 
F199h QVKLEESGGG LVQPGGSMKL SCAAS GFTFS B^WMDWVRQS PEK6LEWAE 

< Hl-a • S2 50 

100 

imcph SRNKGNKYTT EYSASVKGRF IVSRDTSQSI LYL(»1NALRA EDTAIYYCAR 
P199h I GNKGNNHA T YYAESVKGRF TVSRDTSKSR VYLQMNSLRV EDTGTYYCTT 

Hll-d > * < Si *100 



143 



imcph NYYG STWYFDVWGA GTTVTVSSES ARNPTIYPLT LPP.ALSSDP 

F199h R. . . FA YWGO GTLVTVSAAK TTPPSVYPLA PGSAAQTNSM 

Kill > 



136 
193 



imcph VIIGCLIHDY FPSGTT4NVTW GKSGKDITTV NFPPALASGG -RYTMSNQLTL 
F199h VTLGCLVKGY FP.EPVTVTW NSGSLSSGVH TFPAVLQSD. LYTLSSSVTV 



185 



222 



imcph PAVECPEGES VKCSVQH.DS NPVQELDVNC 
F199h PSSTWP.SET VTCNVAHPAS STKVDKKIVP RDC 



217 



Complementarity Determining Loops are underlined. 
Canonical Structure Type (reference 4} is indicated 
Framework residues important to Loop conf ozmatiions 



alphabetically 
indicated by 



Segments used in the V-min design are demarcated by < >. 

The order of appearance in the V-min sequence is indicated by 8#. 



The MCP and F199 sequences are listed as SEQ ID NOS. 41 and 42 
respectively. 
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PIGDRE 5 

(a) Aumyp ACID SEQUENCE FOR F19.9 V-MIN CONSTRUCTS 

QVQLQSS N-Terminal modifications to facilitate expression 
QVQLH 




TAG sec[uence EQKLISEEDLN* 



(b) AUTSQ ACID SgQUgWCg PGR IMCP V -min 

QPPGKai»EWIAAS HmC»WK XTOgg SASVDIVMTOSBISCgSSQSXJJISGHO 
lOmiAinrQQlCPGQPPKZiLr^GASTRSSGVPDRFTEDXJlVn 

TPGAGTiLL 

EQKIiXSEEDLN^ 



The F19.9 and IMCP sequences are listed as SEQ. ID NOS 43 and 44 
respectively. 
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FISORE 6 IITITXAL MPCLEOTIDE SEQUENCE FOR Fig. 9 V»inin 



ACTGGCAClTATTACTGTACCACGCGGTTTGCTTAerGGGGCCAAGGGACTCTC 

1 - -+ + + + 54 

TGACCGTGAATAATGACATGGTGCGCCAAACGAATGACCCCGGTTCCCTGAGAC 
TGTYYCTTRFAYWGQGTL 

GTCGGAGGATCCATGAAACTCTCiTGTGCTGCCrCTGGATTCACrriT 

55 + + 102 

CAGCCTCCTAGGTACirrGAGAGAACACGACGGAGACCTAAGTGAAA^ 
VGGSMKLSCAASGFTF 



AGTGACGCCTGGATGGACTGGGTCCGCCAGTCTCCAGAGAAGGGGCITGAGTGGGTTGC^^ 

103 + + + + + +- -162 

TCACTGCGGACCTACCTGACCCAGGCGGTCAGAGGTCTCri'CCCCGAACTCACCCAACGA 
SDAWMDWVRQS PE KGLEWVA 

• ■ • • ■ • 

GAAATTGGAAACAAAGGTAATAATCATGOUICGTACTATGCT 
163 ^. + + + + + 228 

CTTTAACCTTrGTTTCCATTATTAGTACGTTGCAT^^ 

EIGNKGNNHATYYAESVDIKMT 

• - • > • • 

CAGTCTACTATCACTTGCAAGGCGAGTCAGGACATTAATAGCT^^ 

229 -+ + + + + 288 

GTCJWjATGATAGTGAACGTTCCGCTCAGTCCTGTAATTATCGATA;^ 

QSTITCKASQDINSYLSWFQ 
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FIGURE 6 . CONTINUED 

• • • • • . 

CAGAAACCAGGGAAATCTCCTAAGACCCTGATCTACCGTGCAAACT^AT^ 

289 - + -I + - + + + 

GTCTTTGGTCCCTTTAGAGGATTCTGGGACTJ^^ 

QKPGKSPKTLIYRANRLVDG 

• • • • • • 

GXCCCATCAAGGTTCTCTGAAGATATGGGAATTTATTACT 

349 - + + + + + + 408 

CAGGGTAGTTCCAAGAGACITCTATACCCnTAAATAATAAC^^ 

VPSRPSEDMGIYYCLQYDEF 

• ■ ■ 

CCTCGGACGTTCGGTGGAGGCACCAAGCTG 

409 - + + + 438 

GGAGCCTGCAAGCCACCTCCGTGGTTCGAC 
PRTFGGGTKL 
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FIGURE 7 




RUBFiTmiTF .qWFPT 
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lO/SS 

FISDRB 8 




IMCP VMIN 



SUBSTJTUTE SHEET 
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FIGURE 9 




IMCP Fv IMCP Vmin 



SUBSTITUTE SHEET 
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FIGURE 10 




SUSSTITUTE SHEET 
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FIGURE lOA 



r 
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F19.9 VMEK LYS 97 ENVIRONlffiHT 



SUBSTITUTE SHEET. 



f 
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IMCP VMIN EirVTRONMECrPS FOR RESIDUES 
LYS 101 AND Gin 108 



SUBSTITUTE SHEET. 
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FIGURE 13 




I 107 



Yj p-Strand Arrangement 
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FIGURE it 



H3 HI H2 
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FIGURE 15 



FvP-Sandwich . 

H3 HI H2 
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FIGURE 16 



Con s erva. t ± vb 
Vmin Construct 
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FIGURE. IT 
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FIGURE 19 



Minimal 

Vmin Construct 
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FIG. 20 COMPLETE DNA SEQUENCE 01 McPC603 Ymin GENE 

SHOWING OLIGOWnCLEOTIDE SEQUENCES - 



10 



30 



ATCGATGAATTCCTGCAGCAIATGA 



50 



GTACTGCTATCTACTACTGTGCACGTAACT 



CATGTAGCTACTTAAGGACGTCGIATACTCATGA 



70 



90 



CGATAGATGATGACACGTGCATTGA 



110 



ACTATGGCTCCACTTGGTACTTCGACGTTTGG 



GGTGCTGGTACCACTGTAGGCGGCTCCC 



TGATACCGAGGTGAACCATGAAGCTGCAAACCCCACGA 



130 



150 



CCATGGXGACATCCGCCGAGGG 



170 



TCCGTCTGTCTTGCGCAACIAGTGGCTTCAC 



CTTCTCTGACTTCTACATGGAGTGGGTAC 



AGGCAGACAGAACGCGTTGATCACCGAAGTGGAAGA 



190 



210 



GACTGAAGATGTACCTCACCCATG 



230 



GCCAGCCGCCGGGTAAACGTCTCGAGTGGAT 



CGCAGCTAGCCGIAACAAAGGTAACAAGT 



CGGTCGGCGGCCCATTTGCAGAGCTCACCTAGCGTCG 



250 



270 



AICGGCATTGTTTCCATXGTTCA 



290 



ATACCACCGAATACAGCGCTTCT6TTGATATC 



GTCATGACCCAGTCTATGICTTGCAAAT 



TATGGTGGCTTATGTCGCGAAGACAACIATAGCAGIA 



310 330 
CTTCTCAGTCTCTGCTGAACTCTGGTAACCAGAA 



CTGGGTCAGAZACAGAACGTTTA 



350 



GAAGAGTCAGAGACGACTTGAGACCATTGG 



370 



AAACTTCCTCGCGTGGIATCAGCAAA 



TCTTTTTGAAGGAGCGCACCAIAGTCGm 



390 



AGCCTGGCCAGCCGCCGAAACTCCTGATCTACGGT 



TCGGACCGGTCGGCGGCITTGAGGAC7AGA 



430 



CGGACCGTTTCACCGAAGACCTGGCT 



450 



410 



GCGTCTAC6CGI6AAICTGGTGTTC 



TGCCACGCAGAIGCGCACIXAGACCACAAG 



470 



GTTTACIACTGTCAAAATGAICACICIIACCCGC 



GCCTGGCAAAGTGGCTTCTGGACCGACAAATG 



ATGACAGTTTIACTAGTGAGAATGGGCG 
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FIGURE 20 . CONTINUED 



490 



TGACCTTTGGCGCCGGCACCAAACTG 



510 



530 



GAACAGAAACTGATCTCTGAAGAAGACCTGAACT 



ACTGGAAACCGCGGCCGTGGTTTGACCTTGT 



550 



CTTTGACTAGAGACTTCTTCTGGACTTGA 



AATAATACGTAAGATCTG 



TIATTATGCATTCTAGACCTAG 
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FIG. 21 PEPTIDE SEQUENCE OF McPC603 Vain 
(including c-myc peptide i:ag) 

1 TAIYYCABNY YGSTWYFDVW GAGTTVGGSL RLSCATSGFT FSDFYHEffVR 

51 QPPGKRLE¥I AASRHKtaJKY TTEYSASVDI VMTQSMSCKS SQSLLNSGNQ 

101 KNFLAffYQQK PGQPPKLLIY GASTRESGVP DRFTEDLAVY YCQNDHSYPL 

151 TFGAGHXEg KLISEEDLN * * 

c-myc tag 
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SUBSTITUTE SHEET 
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FIGURE 2U 
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FIGURE 23 



EcoRI 

AATTCTCCCA MTATTCTCA AATCACCTCT TGACAATTAA TCATCCAACT 

ACTTAACTAC TACCCAACTT CACCTAAAAA CCCTATCCAC 

Kpnl BamHI Xbal Sail PstI SphI 
AATCCTACCC GCGGATrctTjACACTCa^ CTGCAGCCAT_GCAAGCTTAG 

Clal/AccI 
CGSGCCTAAT CAGCGCCCTT TTrTTTATCG AC 



I 



SUBSTITUTE SHEET 
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FIGURE 26 




Smg Smg BglH 
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••-•i^-ri-ri ixff QWFFT 
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rIGUEE 28 



TRANSOIIPTION TERMINATION SEQUENCE 



Sail 

5 ' TCGACATTATATTACTrAATTAATTGGGGACCCTAGAGGTCCC L 1 1 mT ATTTTAA 
3 ' GTAATATAATGATTAATTAACCCCTGGGATCTCCAGGGGAAAAAATAAAATT 



S£hl Hindlll 
AAAGCATGCA 
TTTCGTACGTTCGA 



SUBSTITUTE SHEET 
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(from pTB244-T4ter) 

UGATE 




SUBSTITUTE SHEET 



wo 94/12625 



35/55 



PCT/GB93/02375 



Styl / Avrll 
StuI 



PvuII 
BstXI 
Nrul 
5000 



5599 / EcoRI 
liuni 



Apal 
Sad I 
EcoRV 
Bbel 



Aatll 




Eel 1361 I 
SstI 
AccSSI 
Kpnl 



4000 



Mlcil 
Clal 
Msg I 



EcoiNI 



RsrII 



2000 
DrdI 



3000 
Csul 

Bsal 
Eoml 1.051 



I dure 



30 
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Fig. 31 



Aafn 
4000 



Beg I 
Seal 



Bgll 

Cfnoi 

Bsal 
Banf— I 



3000 




4160/EcoRI 



Pvul 
3500 




Afim 




pICI.1079 
4160 



ST 



or 



2000 



Sad 



BsfEH 
1500 
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Fig, 32 CONSTRUCTinN OF pirn 



al 



603 Vmin 



Scal/Xbal 
digest 



EcoRI 




Kpnl/T4/Xbal 
digest 




EcoRI 




PICI1572 



TetR 



ori 




603 Vmin 



cer T4term 



Xbal 
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FIG. 33 COMPLETE DNA SEQUENCE OF A 19.9 Vmin GEME 
SHOWING OLIGONUCLEOTIDE SEQDENCES 



10 



30 



ATCGATGAATTCCTGCAGCAZATGA 



50 



GTACTGGTACTTACTACTGCACTACTCGCT 



CCATGTAGCTACTTAAGGACGTCGTATACTCATGA 



70 



90 



CCAXGAATGATGACGTGATGAGCGA 



110 



TTGCATATTGGGGCCAAGGTACTCTGGTTG 



GTGGTTCCATGAAACTCTCTTGCGCTGCCT 



AACGTATAACCCCGGTTCCATGAGACCAACCACCA 



130 



150 



AGGTACTTTGAGAGAACGCGACGGA 



170 



CAGGCTTCACTTTTAGCGACGCCTGGATG 



GACTGGGTCCGTCAGTCTCCGGAGAAAGGTC 



I 



GTCCGAAGTGAAAATCGCTGCGGACCTACCTGACC 



190 



210 



CAGGCAGTCAGAGGCCTCTTTCCAG 



230 



TGGAATGGGTGGCCSAAATTGGCAACA 



AAGGIAACAACCACGCAACCTACTATGCTGAGT 



ACCTTACCCACCGGCTTTAACCGTTGTTTCC 



250 

■ 

CTGTTGATATCAA 



270 



ATTGTTGGTGCGTTGGATGATACGACTCA 



290 



AATGACCCAGTCTACTATCACGTGTAAGGCTAGCCAGGACATCAACA 



GACAACTATAGTTTIACTG 



310 



GGTCAGATGATAGTGCACATTCCGATCGGTCCTGTAGTTGT 



330 



350 



GCTACCTGAGCTGGTTCCAGCAGAAACCCGGGAAATCTCCGAAAACATTAATCTA 

I 



CCGTG 



CGAT 



GGACTCGACCAAGGTCGTCTTTGGGCCCTTTAGAGGCTTTTGTAATTAGATGGCAC 



I 



370 



390 



410 



CAAACCGTCTGGTTGATGGTGTACCCTCGAGGTTCTCTGAAGATATGGGTATTT 



ACTACT 



GTTTGGCAGACCAACTACCACATGGGAGCTCCAAGAGACTTCTATACCCATAAATGAT 



I 



GA 
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FIGDHE 33 CONTINU] 



430 



450 



470 



GTCTACAGTATGACGAGTTCCCGCGTACCTTCGGTGGCGGCACCAAGCTT 



GAACAGAAAC 



CAGATGTCATACTGCTCAAGGGCGCATGGAAGCCACCGCCGTGGTTCGAACTTGT 



CTTTG 



490 



510 



TGATCTCTGAAGAAGACCTGAACTAATAATACGTAAGATCTG 



ACTAGAGACTTCTTCTGGACTTGATTATTATGCATTCTAGACCTAG 
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yiG-3A PEPTIDE SEQUENCE OF 19.9 Vmin 
(inclnding c-myc peptide t:ag> 

1 TGTYYCTTRF AY¥GQGTLVG GSMKLSCAAS GFTFSDAffHD yVRQSPEKGL 
51 EWAEIGNKG NNHATYYAES VDIKKTQSTI TCKASQDINS YLS¥FQQKPG 
101 KSPKTLIYRA NRLVDGVPSR FSEDHGIYYC LQYDEFPRTF GGGTKLEQKL 

151 ISEBDUJ** 
c-myc tag 
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Figure 3 5 

1 ATG AAA TAG CTA TTG CCT ACG GCA 
Met Lys Tyr Leu Leu Pro Thr Ala 

4 9 GCC CAA CCA GCG ATG GCC CAG GTG 
Ala Gin Pro Ala Met Ala Gin Val 

97 CTG GTG GCG CCC TCA CAG AGC CTG 
Leu Val Ala Pro Ser Glxi Ser Leu 

145 TTG TCA TTA AGG GGG TAT GGT GTA 
Phe Ser Leu Thr Gly Tyr Gly Val 

193 AAG GGT GTG GAG TGG CTG GGA ATG 

Lys Gly Leu Glu Trp Leu Gly Met 

241 TAT AAT TCA GGT CTG AAA TGG AGA 
Tyr Asn Ser Ala Leu Lys Ser Arg 

289 AAG AGC CAA GTT TTG TTA AAA ATG 

Lys Ser Gin Val Phe Leu Lys Met 

337 GCC AGG TAG TAG TGT GCC AGA GAG 
Ala Arg Tyr Tyr Cys Ala Arg Glu 

385 GGC CAA GGG AGC ACG GTG AGC GTG 

Gly Gin Gly Thr Thr Val Thr Val 
433 GAG GTT GGA TGG AAA TTG TAT TTG 



481 CTA TTG GGT AGG GCA GCG GGT GGA 

Leu Leu Pro Thr Ala Ala Ala Gly 

529 GCG ATG GGC GAG ATG GAG GTG ACG 

Ala Met Ala Asp lie Glu Leu Thr 

577 TCT GTG GGA GAA ACT GTG AGG ATG 
Ser Val Gly Glu Thr Val Thr lie 

625 GAG AAT TAT TTA GCA TGG TAT GAG 
His Asn Tyr Leu Ala Trp Tyr Gin 

673 CTG CTG GTG TAT TAT ACA AGA AGG 
Leu Leu Val Tyr Tyr Thr Thr Thr 

721 TTG AGT GGC AGT GGA TCA GGA ACA 
Phe^Ser Gly Ser Gly Ser Gly Thr 

76 9 CTG CAA GGT GAA GAT TTT GGG AGT 
Leu Gin Pro Glu Asp Phe Gly Ser 

817 ACT GGT GGG ACG TTG GGT GGA GGG 
Thr Pro Arg Thr Phe Gly Gly Gly 



GCC GGT GGA TTG TTA TTA CTG GGT 48 

Ala Ala Gly Leu Leii Leu Leu Ala 

CAG CTG CAG GAG" TCA GGA CCT GGC 96 
Gin Leu Gin Glu Ser Gly Pro Gly 

TCC ATG ACA TGG ACG GTG TCA GGG 144 
Ser lie Thr Cys Thr Val Ser Gly 

AAG TGG GTT GGC GAG CCT GCA - GGA 192 
Asn Trp Val Arg Gin Pro Pro Gly 

ATT TGG GGT GAT GGA AAG ACA GAG 240 

lie Trp Gly Asp Gly Asn Thr Asp 

CTG AGC ATG AGC AAG GAG AAG TGG 288 
Leu Ser lie Ser Lys Asp Asn Ser 

AAG AGT GTG CAG ACT GAT GAG AGA 336 

Asn Ser Leu His Thr Asp Asp Thr 

AGA GAT TAT AGG CTT GAG TAG TGG 3 84 

Arg Asp Tyr Arg Leu Asp Tyr Tirp 

TGG TCA TAA TAA GAG CTA TCC GGG 432 

Ser Ser 

AAG GAG ACA GTG ATA ATG AAA TAG 480 

Met Lys Tyr 

TTG TTA TTA CTG GCT GCC CAA GGA 528 

Leu Leu Leu Leu Ala Ala Gin Pro 

GAG TCT GCA GCC TCC CTT TCT GCG 576 

Gin Ser Pro Ala Ser Leu Ser Ala 

AGA TGT GGA GCA AGT GGG AAT ATT 624 
Thr Cys Arg Ala Ser Gly Asn lie 

CAG AAA CAG GGA AAA TCT CCT CAG 672 

Gin Lys Gin Gly Lys Ser Pro Gin 

TTA GCA GAT GGT GTG GCA TCA AGG 720 
Leu Ala Asp Gly Val Pro Ser Arg 

CAA TAT TCT CTG AAG ATG AAC AGC 768 
Gin Tyr Ser Leu Lys lie Asn Ser 

TAT TAG TGT CAA CAT TTT TGG AGT 816 

Tyr Tyr Cys Gin His Phe Trp Ser 

ACG AAG CTG GAG ATG AAA GGG GAA 864 

Thr Lys Leu Glu lie Lys Arg Glu 
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Figure 3 5 f continued) 

865 CAA AAA CTC ATC TCA GAA GAG GAT CTG AAT TAA TAA TGA TCA AAC GGT 
Gin Lys Leu lie Ser Glu Glu Asp Leu Asn 

913 AAT AAG GAT CCA GCT CGA ATT CAC TGG CCG TCG TTT TA 950 
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Figure 3^ 
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Figure 3^ 
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Ficnire 40 D1.3 V-min conservative 



MKYLLPTAAAGLLLLAAQPAMA pel B secretion leader 

SRLSISKDNSKSQVFLKMNSLHTDDTARYYCARERDY 
h65 0 0 0 0.. 

RLDYWGQGTTLTVS APSQSLSITCTVSGFSLTGYGV 
0. . .S. . .hllS hl3 0 0 : 

NWVRQPPGKGLEWLGMIWGDGNTDYNSAIiK DIQMTQ 
0 0 0 A-h64 11 

S PASLS ASVGETVT I T CRAS GN I HNYLAWYQQKQGKS 
* 0 0 0 0 

PQLLVYYTTTLADGVPSRFSGSGSGTQYS liKINSLQP 
0 0 0 0 

E D F G S Y Y.C QHFWSTPRTFGGGTKLEIKR 

0 ; 0 1108 



EQKLISEEDLN tag sequence 
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Figure 41 

a) h65-115/hl3-64/ll-108 Intact Light Chain 

MetLysTyrLeuLeuProThrAlaAlaAlaGlyLeuIieiiLeiiLeiiAlaAlaGlnProAlaMetAla. 
pel B secretion leader 

AspTyrProGlySerSerGly Secretion facilitator 

SerArgLeuSerlleSerLysAspAsnSerLysSerGlnValPheLeuLysMetAsnSerLeuHisThrAspAspThr 

ties 70 80 90 

AlaArgTyrTyrCysAlaArgGluArgAspTyrArgLeiiAspTyrTr^ 

100 110 . .hllS 

AlaProSerGlnSerLeuSerlleThrCysThrValSerGlyPheSerLeuThrGlyTyrGlyValAsnTrpValArg 
ill3 20 30 




50 60 h64 



AspIleGlnMetThrGlnSerProAlaSerLeuSerAlaSerValGlyGluThrValThrlleThrCysArgAlaSer 
11 10 20 

GlyAsnlleHisAsnTyrLeuAlaTrpTyrGlnGlnLysGinGlyLysSerProGlnLeiiLeuValTyrTyrT^ 
30 40 5Q 

ThrLeuAlaAspGlyValProSerArgPheSerGlySerGlySerGlyThrGlnTyrSerLeiiLyslleAsnSerLeu 

SO 70 

GlnProGluAspPheGlySerTyrTyrCysGlnHisPheTrpSerThrProArgThrPheGlyGlyGlyT 
• • • -80 90 100. 

GluIleliysArg. ... 
1108 

GluGlnLysLeuIleSerGluGluAspLexiAsn tag sequence 
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Figure 42 

b) 161-107/li3-60/hl-116 Intact Heavy Chain D1.3 

MetLysTyrLeiilieuProThrAlaAlaAlaGlyLeuLeuLeiiLeuAlaAlaGlnProAlaMetAla. . . . 
pel B secretion leader 

GlySerSerSerGly Secretion facilitator 

ArgPheSerGlySerGlySerGlyThrGlnTyrSerLeuLysIleAsnSerLeuGlnProGluAspPheGlySerTyr 
161 70 80 

TyrCysGlnHisPheTrpSerThrProArgThrPheGlyGlyGlyThrliysLeuGluIleLys .... 
90 100 1107 

AlaSerValGlyGluThrValThrlleThrCysArgAlaSerGlyAsnlleHisAsnTyxLeiiAlaTrpTyrGlnGln 
113 20 30 , . . 

LysGlnGlyLysSerProGlnLeiilieuValTyrTyrThrThrThrLexaAlaAspGlyValProSer. . . , 
- • • -40 50 160 

GlnValGlnLeiiliysGluSerGlyProGlyLeuValAlaProSerGlnSerLeiiSerXleThrCysThrValSerGly 

hi 10 ; 20 

PheSerLeuThrGlyTyrGlyValAsnTrpValArgGlnProProGlyLysGlyLeuGluTrpX^euGlyMetrieTrp 
30 40 50. . ! . . . 

GlyAspGlyAsnThrAspTyrAsnSerAlcLLeuLysSerArgLexoSerlleSerLysAspAsnSerliysSerGlnVal 
60 70. 

PheJLeuLysMetAsnSerleiiHisThrAspAspThrAlaArgTyrTyrCysAlaArgGliiArg;^ 

• - • . 80 90 100 

Tyr TrpGlyGlnGlyThrThrlieuThrValSer S er 
110. hll6 

GluGlnliysLeuIleSerGluGluAspIieuAsn tag secjuence 



wo 94/12625 



PCT/GB93/0237S 



50/55 



Fierure 43 



c) h87-ii5/hl3-S4/ll-7/121-61/17S-108 Minimal Construct Dl.3 

MetLysTyrLeuLeuProThrMaAlaAlaGlyLeuLeuIieuLeuAJaAlaGlnProA^ 
pel B secretion leader 



ThrAspAspThrAlaArgTyrTyrCysAlaArgGluArgAspTyrArgLeuAspT^ 

90 100 110 

ThrValSer . . . 
tills 

AlaProSerGlnSerJ 

ills 20 30 




GlnProProGlyLysGlyLeuGluTrpieuGlyMetlleTrpGlyAspGlyAsnThrAspTyrAsnSerAlaLeuLy^ 
• • • '^^ -50 go h64 

AspIleGlnMetThrGlnSer. . .IleThrCysArgAlaSerGlyAsnlleHisAsiiTyrLeuAlaTrpTyrGlnGln 
11 7. ..121 30... 




. . . . 40 . . ; 50 



ISI 



AsnSerlieuGlnProGliiAspPheGlySerTyrTyrCysGlnHisPheTrpSerThrProArgThrPhe^^ 

^"^^ SO 90 100... 

ThrLysLieuGlulleliysArg 
1108 

GluGlnLysLeuIleSerGluGluAspLeuAsn tag sequence 
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Figure 44 

a) h67-l25/hl3-66/ll-ii2 Intact Light Chain KOL 

MetLysTyrLeuLeuProThrAlaAlaAlaGlyLeiiLeiiLeiiLeuAlaAlaGlnProAlaMetAla. . . . 
pel B secretion leader 

GlySerSerSerGly Secretion facilitator 

ArgPheThrlleSerArgAsnAspSerLysAsnThrLeuPhelieuGlnMetAspSerLeuArgProGliiAspThrGly 
h67 70 80 90 

ValTyrPheCysAlaArgAspGlyGlyHisGlyPheCysSerSerAlaSerCysPheGlyProAspTyrTrpGlyGln 
100 110 

GlyThrProValThrValSer . . . GlnProGlyArgSerLeuArgLeuSerCysSerSerSerGlyPhellePheSer 
. . .120 hl25 hl3 20 30 

SerTyrAlaMetTyrTrpValArgGlnAlaProGlyLysGlyLeuGluTrpValAlallelleTrpAspAspGlySer 
40 50 

AspGlnHisTyrAlaAspSerValljysGly . . . GlnSerValLeuThrGlnProProSerAlaSerGlyThrProGly 
60 h66 11 10 

GlnArgValThrlleSerCysSerGlyThrSerSerAsnlleGlySerSerThrValAsnTrpTyrGlnGlnleuPro 
20 30 40. . . 

GlyMetAlaProLysIjeuIieuIleTyrArgAspAlaMetArgProSerGlyValProAspArgPheSerGlySerljys 
50 60 

SerGlyAlaSerAlaSerLeiiAlalleGlyGlyljeuGlnSerGliiAspGluThrAspTyrTyrCysAlaAlaTrp;^ 

70 80 90 

ValSerLeuAsnAlaTyrValPheGlyThrGlyThrLiysValThrValLeuGlyGln 
100 110. .1112 



GluGlnLysLeuIleSerGluGluAspIieiiAsn tag sequence 
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FicTure 45 

b) 162-109/ll2-61/hl-125 Intact Heavy Chain KOL 

MetLysTyrLeuLeuProThrAlaAlaAlaGlyLeuJjeuIjexilieuAlaAlaGlnProAlaMet;^ 
pel B secretion leader 



GlySerSerSerGly Secretion facilitator 



ArgPhei 

162 70 80 




TyrCysAlaAlaTrpAspVaiSerLeiiAsnAlaTyrValPheGlyThrGlyThrLysVal . . . GlyThrProGlyGln 
50 100 1109 112 

ArgValThrlleSerCysSerGlyThrSerSerAsnlleGlySerSerThrValAsnTrpTyrGlnGlnLeuProGly 
20 30 40 

MetAlaProLysLeuLeuIleTyrArgAspAlaMetArgProSerGlyValProAsp. . . GluValGlnLeuValGln 
50 161 hi 

SerGlyGlyGlyValValGliiProGlyArgSerLeiiArgLeuSerCysSerSerSerGlyPhellePheSerSer^ 
3.0 20 30 

AlaMetTyrTrpValArgGlnAlaProGlyLysGlyLeuGluTrpValAlallelleTrpAspAspGlySerAspGln 
40 50 

HisTyrAlaAspSerValliysGlyArgPheThrlleSerArgAsnAspSerl-ysAsnThrLeuPhelieuGlnM 
• • • • 60 70 . . . 80 

SerLeuArgProGliiAspThrGlyValTyrPheCysAlaArgAspGlyGlyHisGlyPheCysSerSerAlaSerCys 
90 100 110 

PheGlyProAspTyrTrpGlyGlnGlyThrProValThrValSer 
.-120 hl2S 

GluGlnliysljeuXleSerGluGluAspLieuAsn tag sequence 
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Ficmre 4 6 

a) h66-ll8/hl3-65/ll-ll2 Intact Light Chain PAC-1 

MetLysTyrLeuLeiiProThrAlaAlaAlaGlyLeuLexiLeiiLeiiAlaAlaGlnProAlaMet;^ 
pel B secretion leader 

GlySerSerSerGly Secretion facilitator 




^" 80 ..:t:9o^.^ 

IleTyrTyrC^sAlaArgArgSerProSerTyrTyrArgTyrAspGlyAlaGlyProT^^ 

100 110 

Gly . . . GlnProSerGlnSerlieuSerlleThrCysThrValSerGlyPheSerlieuThrSerTyrGlyValHisTm 
hue hl3 20 30 

ValArgGlnSerProGlyLysGlyLeuGluTrpLeuGlyVallleTrpSerGlyGlySerThrAspTyrAsnAlaAla 
40 50 60 

PhelleSer, . . AspVallieuMetThrGlnGlnThrProLeuSerLexaProValSerLeuGlyAspGlnAlaSerlle 
^" 10... 20... 

SerCysArgSerSerGlnSerlleValHisSerAsnGlyAsnThrTyrLeuGluTrpTyrLeuGlnLysP 

30 ; 40 

SerProLysLeuLieuIleTyrliysValSerAsnArgPheSerGlyValProAspArgPheSerGlySerGlySerGly 
50 60 

ThrAspPheThrl^euLys 1 1 eS er ArgValGliiAlaGluAspLeuGly Val Tyr Tyr Cy s PheGlnGlyS erH^ 
80 90 

ProTyrThrPheGlyGlyGlyThrLyslieuGluIleLys 
110. .1112 

GluGlnLysLeuIleSerGluGlxiAspLeuAsn tag 
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Figure 47 

b) 166-lli/li2-65/hl-123 Intact Heavy Chain PAC-1 

MetLysTyrLeuLeuProThxAlaAlaAlaGlyLe\jIieuLeuLeuAlaAlaGiiiProAlaM 
pel B secretion leader 



GlySerSerSerGly Secretion facilitator 



ArgPheSerGlySerGlySerGlyThrAspPheThrLeiiliysIleSerArgValGliiAlaGluAspLeuGlyValTyr 
70, 80..: 90... 

TyrCysPheGlnGlySerHisValProTyrThrPheGlyGlyGlyThrLysLeuGluIle . . . ProValSerLeuGly 
• 100 ; 1111 112 

AspGlnAlaSerlleSerCysArgSerSerGlnSerlleValHisSerAsnGlyAsnThrTyrLeuGluTrpTyrLeu 
20 30 40 

GlnLysProGlyGlnSerProLysLeiiLeuIleTyrLysValSerAsnArgPheSerGlyValProAsp . . .GlnVal 
50 60 165 hi 

GlnlieuLysGlnSerGlyProGlyLeuVcacinProSerGlnSerLeuSerlleThrCysThrValSerGlyPheSer 
10 20 

LeuThrSerTyrGlyValHisTrpValArgGlnSerProGlyLysGlyLeuGluTrpLieuGlyVallleTrpSerGly 
• • • • 30 40 • 50 

GlySerThrAspTyrAsnAlaAlaPhelleSerArgLeuSerlleSerLysAspAsnSerLysSerGlnValPhePhe 

60 70. . 80 

LysMetAsnSerlieuGliiAlaAsnAspThrAlalleTyrTyrCysAlaArgArgSerProSerTyr^ 
, . .90 XOO. 

GlyAlaGlyProTyrTyrAlaMetAspTyxTrpGlyGlnGlyThrSerVal 
110 120 hl23 

GluGlnLysLeuIleSerGluGluAspLeuAsn tag sequence 



wo 94/12625 



55/55 



PCT/GB93/0237S 



MOLECULAR DEVICES 
Raw Data (Plate) 



DATA FILE: 
DESCRIPTION: 
PROTOCOL: 
DESCRIPTION: 
MODE: 
WAVELENGTH: 
MEAN TEMP: 



Vcon assay 3.9.93 
VCON ASSAY 3.9.93 



Endpoint 
490 

22.50»C 



AUTOMIX: ON 



PRINTED: 10/11/93 



CALIBRATION: ON 
SET TEMP: OFF 



Optical Density 

5 6 7 8 



10 11 



1 2 



3.856 


3.892 


3.706 


■ 

3.237 


2.697 


1.904 


1.214 


0.650 


0.417 


0.293 


0.255 


0.215 


3.946 


3.865 


3.724 


3.300 


2.716 


1.981 


1.196 


0.701 


0.408 


0.298 


0.242 


0.220 


0.195 


0.180 


3.940 


0.165 


0.165 


0.168 


0.160 


• 

0.163 


0.163 


0.172 


0.174 


0.168 


0.168, 


0.175 


0.180 


0.174 


0.165 


0.177 


0.164 


0.166 


0.174 


0.172 


0.189 


0.175 


0.172 


0.167 


0.167 


0.161 


0.165 


0.168 


0.159 


0.161 


0.167 


i 0.195 


0.177 


0.164 


0.199 


0.178 


3.978 

« 


0.167 


0.175 


0.540 


0.172 

4 


0.180 


0.176 


0.173 


0.171 


0.170 


0.173 


0.181 


0.174 


0.398 


0.167 


0.178 

■ 


0.174 


0.180 


0.171 


0.174 


0.163 


0.171 


0.171 

• 


0.180 


0.171 


0.174 


0.176 


0.174 


0.180 


0.185 


0,180 


0.201 


0.180 


0.158 



A 
B 
C 
D 
E 
F 
G 
H 



FIGURE U8 



« 



